Chimaeric Surrogate Light Chains (SLC) Comprising Human VpreB

ABSTRACT

The present invention relates inter alia to improvements in the production of chimaeric antibodies in non-human transgenic vertebrates such as mice and rats bearing one or more chimaeric antibody transgenes. In particular, the invention provides for improved non-human vertebrates and cells in which VpreB has been species-matched with the variable region of the chimaeric antibodies. Also, embodiments also provide for species-matching of the entire surrogate light chain for efficient pairing with chimaeric heavy chains during B-cell development in vivo in a non-human transgenic vertebrate setting.

This application is a continuation of U.S. application Ser. No. 14/226,698, filed Mar. 26, 2014, which is a continuation of PCT/GB2012/052380, filed Sep. 26, 2012, which claims the benefit of GB 1116495.1, filed Sep. 26, 2011, and GB 1120423.7, filed Nov. 28, 2011, the contents of each which are incorporated by reference herein in their entirety. The attached sequence listing is hereby incorporated by reference.

The present invention relates inter alia to improvements in the production of chimaeric antibodies in non-human transgenic vertebrates such as mice and rats bearing one or more chimaeric antibody transgenes. In particular, the invention provides for improved non-human vertebrates and cells in which VpreB has been species-matched with the variable region of the chimaeric antibodies. Also, embodiments provide for species-matching of the entire surrogate light chain for efficient pairing with chimaeric heavy chains during B-cell development in vivo in a non-human transgenic vertebrate setting.

Furthermore, the invention relates to a method of using the vertebrates to isolate antibodies or nucleotide sequences encoding antibodies. Antibodies, nucleotide sequences, pharmaceutical compositions and uses are also provided by the invention.

BACKGROUND

Advancements in the construction of transgenic mice bearing human antibody gene loci have led researchers to move away from the provision of entirely human transgenic antibody loci (bearing human variable and constant region gene segments) to chimaeric transgenic antibody loci. The transgenic loci comprise human V(D)J segments operably connected upstream of non-human constant regions. The use of constant regions that are endogenous to the transgenic non-human vertebrate (eg, endogenous mouse or rat constant regions) are desirable to harness the endogenous control of antibody generation and maturation following immunisation of the vertebrate with antigen.

B-cell development is characterized by the ordered rearrangement of immunoglobulin variable region genes. After the VDJ rearrangement of the H chain gene segments, a precursor (pre)-B-cell is generated. After the functional rearrangement of a light chain gene, the cells develop into surface IgM-bearing mature B-cells. Fully assembled IgH and L chains are transported onto the surface of B-cells, while free H chains are retained in the endoplasmic reticulum (ER) in association with BiP.

A critical step in B-cell differentiation is the selective expansion of cells with a functional μ heavy chain resulting from productive IgH rearrangement. This is achieved by the association of the μ heavy chain with the surrogate light chain (SLC) proteins λ5 and VpreB and the signal transducing heterodimer Igα and β to form the pre-B-cell receptor (pre-BCR). The expression and formation of a pre-BCR dramatically improves the efficiency of pre-B and B-cell production, by signalling proliferative expansion of pre-B-cells. The major function of the pre-BCR is the selection and expansion of cells that have undergone a productive VDJ rearrangement. The expression of membrane-bound μ chains is essential for the clonal expansion, and initiation of L chain gene rearrangement.

The association of SLC with μ chain works as a checkpoint to determine whether the cell has successfully completed VDJ combination and expresses a functional μH chain. More than half of all μ chains cannot assemble with the SLC, accounting for most of the changes in the V heavy repertoire during B-cell development.

VpreB is homologous to variable regions, λ5 is homologous to light chain constant region. The protein encoded by the two genes can form a tightly, but noncovalently bound heterodimer with the general structure of an L chain. λ5 protein can be covalently disulphide-bonded to μ H chains in pre-B-cells. Whereas VpreB alone can associate with Igμ, λ5 alone cannot. Only when it is non-covalently associated with VpreB does λ5 form a disulphide bridge with the first constant region (CH1) of an Igμ chain to form the pre-BCR.

The λ5 and VpreB, which together form the SLC, are early markers of B-cell commitment. They are expressed at the pro- and pre-B-cell stages and silenced in immature and mature B-cells. In pre-B-cells, following rearrangement of the heavy chain locus, the surrogate light chain (SLC) acts as a chaperone, mediating transport of the newly synthesised heavy chain mu to the cell surface and together with C mu forms part of the pre-BCR (Immunol Today. 1993 February; 14(2):60-8; The surrogate light chain in B-cell development; Melchers F, Karasuyama H, Haasner D, Bauer S, Kudo A, Sakaguchi N, Jameson B, Rolink A). The pre-BCR mediates signalling, leading to proliferation of pre-B-cells that have a productive heavy chain rearrangement. Mice that lack a functional λ5 show a drastic reduction in the number of B-cells. Using VpreB knock-out mice, it has been shown that VpreB is also required for efficient B-cell development, particularly for the transition to pre-BCR bearing cells (pre-611 stage) (see, Int Immunol. 1999 March; 11(3):453-60; Partial block in B lymphocyte development at the transition into the pre-B cell receptor stage in Vpre-B1-deficient mice; Mårtensson A, Argon Y, Melchers F, Dul J L, Mårtensson IL; and Sabbattini & Dillon 2005 infra). As the degree of proliferation has been proposed to be dependent upon the stability of the association of the mu heavy chain with the components of the SLC, the pre-BCR is thought to influence the choice, and thus the extent and diversity of, the heavy chain variable region repertoire.

Besides its role in signalling proliferation, the pre-BCR is also thought to mediate down regulation of the RAG genes, thereby preventing the rearrangement of other IgH loci (allelic exclusion) and the occurrence of double-strand breaks in dividing pre-B-cells. The pre-BCR has also been proposed to exert a negative feedback on λ5 and VpreB expression so that, after a phase of clonal expansion, the large pre-BII cells become depleted of pre-BCR, exit the cell cycle and differentiate into resting small pre-BII cells. At this stage, the SLC is repressed while the IgL loci start to be rearranged. The product of a successfully rearranged IgL gene will then pair with the mu heavy chain to form a BCR with antigen-binding capability on the surface of immature B-cells. These cells migrate to the peripheral blood and secondary lymphoid organs, and develop into mature B-cells ready for subsequent encounter with antigen (Sabbattini & Dillon 2005 infra).

SUMMARY OF THE INVENTION

The inventors were aware of the desirability to use endogenous non-human (eg, mouse or rat) constant regions to harness the endogenous control of antibody generation and maturation following immunisation. They have realised, however, that these considerations do not address earlier B-cell stages in vivo (prior to antigen stimulation) when the animal's B-cell repertoire is maturing. Such maturation includes transition from immature pro-B-cells to pre-B-cells, including the pre-BII stage when B-cells bear surface B-cell receptors (BCR) in which the heavy chain variable region is paired with the VpreB. CH1 regions of heavy chains are paired with λ5 in the pre-BCRs. The inventors realised that the provision of genes encoding chimaeric antibodies is not tailored to the pre-BII stage, and they realised that it is important to address this since this stage features huge clonal expansion required to provide a pool of B-cells for functional heavy chain selection during subsequent B-cell development.

To this end, the present invention provides for species-matching of human V regions in chimaeric heavy chains with human VpreB. In embodiments, the invention also provides for species-matching of the mu constant region with λ5, so that heavy chains in pre-BCRs are completely species-matched with chimaeric surrogate light chain (SLC).

Thus, a first configuration of the present invention provides,

A non-human vertebrate (eg, a mouse or rat) or cell (eg, a mouse cell or rat cell) whose genome comprises an antibody heavy chain transgene,

the transgene comprising

(a) one or more human VH gene segments, one or more human D gene segments and one or more human JH gene segments operably connected upstream of a constant region gene so that the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding an antibody heavy chain having a human variable region and a constant region; or

(b) a rearranged VDJ encoding a human variable region operably connected upstream of a constant region gene so that the transgene encodes (optionally following VDJ combination with the constant region) a chimaeric antibody heavy chain having a human variable region and a constant region;

the genome further comprising a human VpreB gene capable of expressing a human VpreB.

The present invention is applicable to the production of 4-chain antibodies in transgenic animals, where the antibodies each contain 2 heavy chains and 2 light chains. Alternatively, the invention can be applied to the production of H2 antibodies (heavy chain antibodies) which are devoid of CH1 and light chains.

In a second embodiment, the invention provides

A method of constructing a transgenic non-human vertebrate cell (eg, an ES cell, eg, a mouse or rat ES cell), the method comprising

(i) introducing into the genome of a non-human vertebrate cell (or an ancestor thereof) one or more human VH gene segments, one or more human D gene segments and one or more human JH gene segments so that said gene segments are operably connected upstream of a constant region gene (optionally an endogenous non-human vertebrate constant region gene) to form a heavy chain transgene, wherein in said cell or a progeny thereof the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding an antibody heavy chain having a human variable region and a constant region; or

(ii) introducing into the genome of a non-human vertebrate cell (or an ancestor thereof) a rearranged VDJ encoding a human variable region so that said VDJ is operably connected upstream of a constant region gene (optionally an endogenous non-human vertebrate constant region gene) to form a heavy chain transgene, wherein in said cell or a progeny thereof the transgene encodes (optionally following VDJ combination with the constant region) an antibody heavy chain having a human variable region and a constant region;

and wherein the method further comprises introducing into the cell a human VpreB gene capable of expressing a human VpreB.

In a third embodiment, the invention provides

A method of promoting B-cell development in a non-human vertebrate (eg, a mouse or rat), the method comprising

(a) providing in a non-human vertebrate embryonic stem (ES) cell genome an immunoglobulin transgene capable of expressing an antibody mu heavy chain, wherein the antibody heavy chain comprises a human variable region and a mu constant region (optionally an endogenous non-human vertebrate mu constant region); and creating a first non-human vertebrate from said ES cell or a progeny thereof;

(b) providing in a second ES cell genome a second transgene comprising a human VpreB gene capable of expressing a human VpreB; and creating a second non-human vertebrate from said ES cell or a progeny thereof; and

(c) creating by breeding a third non-human vertebrate capable of co-expressing the mu antibody heavy chain and human vpreB wherein a pre-B-cell receptor can form to promote B-cell development of cells bearing a mu heavy chain in said third vertebrate; the third vertebrate being made by crossing said first and second vertebrates or progeny thereof by breeding to create said third vertebrate, the third vertebrate comprising the first and second transgenes, and wherein endogenous heavy chain expression has been inactivated in said third vertebrate.

In a fourth embodiment, the invention provides

A method of promoting B-cell development in a non-human vertebrate (eg, a mouse or rat), the method comprising

(i) inactivating endogenous heavy chain expression in said vertebrate;

(ii) providing in the genome of said vertebrate an immunoglobulin transgene capable of expressing an antibody mu heavy chain, wherein the antibody heavy chain comprises a human variable region and a non-human vertebrate mu constant region (optionally an endogenous non-human vertebrate mu constant region); and

(ii) providing in the genome of said vertebrate a second transgene capable of expressing a human VpreB wherein a pre-B-cell receptor can form to promote B-cell development of cells bearing a mu heavy chain in said vertebrate;

Optionally wherein the genome is homozygous for said first transgene.

An aspect of the invention further provides for the possibility of species-matching in vivo the variable and constant regions of the antibody heavy chain with a chimaeric SLC. This is useful for promoting B-cell development in the transgenic non-human vertebrate (eg, a mouse or rat).

To this end, the invention provides a fifth embodiment:—

A non-human vertebrate (eg, a mouse or rat) or cell (eg, a mouse cell or rat cell) whose genome comprises an antibody heavy chain transgene,

the transgene comprising

(a) one or more human VH gene segments, one or more human D gene segments and one or more human JH gene segments operably connected upstream of a constant region gene of a non-human vertebrate (eg, mouse or rat) species so that the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding a chimaeric antibody heavy chain having a human variable region and a non-human vertebrate constant region; or

(b) a rearranged VDJ encoding a human variable region operably connected upstream of a constant region gene of a non-human vertebrate (eg, mouse or rat) species so that the transgene encodes (optionally following VDJ combination with the constant region) a chimaeric antibody heavy chain having a human variable region and a non-human vertebrate constant region;

the genome further comprising one or more genes together encoding a chimaeric surrogate light chain, the surrogate light chain comprising a human VpreB and λ5 of said non-human vertebrate (eg, mouse or rat) species.

A sixth embodiment of the invention provides

A method of constructing a transgenic non-human vertebrate cell (eg, an ES cell, eg, a mouse or rat ES cell), the method comprising

(i) introducing into a non-human vertebrate cell (or an ancestor thereof) one or more human VH gene segments, one or more human D gene segments and one or more human JH gene segments so that said gene segments are operably connected upstream of an endogenous non-human vertebrate constant region gene, wherein in said cell or a progeny thereof the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding a chimaeric antibody heavy chain having a human variable region and a non-human vertebrate constant region; or

(ii) introducing into a non-human vertebrate cell (or an ancestor thereof) a rearranged VDJ encoding a human variable region so that said VDJ is operably connected upstream of an endogenous non-human vertebrate constant region gene, wherein in said cell or a progeny thereof the transgene encodes (optionally following VDJ combination with the constant region) a chimaeric antibody heavy chain having a human variable region and a non-human vertebrate constant region;

and wherein the method further comprises introducing into the cell a human VpreB gene such that the cell or a progeny thereof is capable of expressing a chimaeric surrogate light chain as well as said chimaeric antibody heavy chain, wherein the surrogate light chain comprises a human VpreB and an endogenous non-human vertebrate λ5.

A seventh embodiment of the invention further provides

A method of promoting B-cell development in a non-human vertebrate (eg, a mouse or rat), the method comprising

(a) providing in a non-human vertebrate embryonic stem (ES) cell an immunoglobulin transgene capable of expressing a chimaeric antibody mu heavy chain, wherein the chimaeric antibody heavy chain comprises a human variable region and an endogenous non-human vertebrate mu constant region; and creating a first non-human vertebrate from said ES cell or a progeny thereof;

(b) providing in a second ES cell a second transgene capable of expressing a human VpreB so that the human VpreB and an endogenous λ5 form a chimaeric surrogate light chain; and creating a second non-human vertebrate from said ES cell or a progeny thereof; and

(c) creating by breeding a third non-human vertebrate capable of co-expressing the chimaeric mu antibody heavy chain and chimaeric surrogate light chain wherein a pre-BCR can form to promote B-cell development in said third vertebrate; the third vertebrate being made by crossing said first and second vertebrates or progeny thereof by breeding to create said third vertebrate, the third vertebrate comprising the first and second transgenes, and wherein endogenous heavy chain expression has been inactivated in said third vertebrate.

A transgenic mouse according to the vertebrate of this method, or a progeny thereof, is provided.

An eighth embodiment of the invention provides

A method of promoting B-cell development in a non-human vertebrate (eg, a mouse or rat), the method comprising

(i) inactivating endogenous heavy chain expression in said vertebrate;

(ii) providing in the genome of said vertebrate an immunoglobulin transgene capable of expressing a chimaeric antibody mu heavy chain, wherein the chimaeric antibody heavy chain comprises a human variable region and an endogenous non-human vertebrate mu constant region; and

(ii) providing in the genome of said vertebrate a second transgene capable of expressing a human VpreB so that the human VpreB and an endogenous λ5 form a chimaeric surrogate light chain; so that the vertebrate is capable of co-expressing the chimaeric mu antibody heavy chain and chimaeric surrogate light chain wherein a pre-BCR can form to promote B-cell development in said vertebrate;

Optionally wherein the genome is homozygous for said first transgene.

A transgenic mouse according to the vertebrate of this method, or a progeny thereof, is provided.

BRIEF DESCRIPTION OF THE FIGURES

FIGS. 1A & B: Schematic representation of the steps involved in replacement of the mouse VpreB1 with human VpreB1. The most 5′ gene is Topo3β.

FIG. 2: Upper Schematic: representation of vector for retrieval of the 19 kb mouse DNA sequence containing the human VpreB1 gene; Lower Schematic:representation of the vector containing the retrieved 19 kb mouse DNA sequence containing the human VpreB1 gene. Black rectangular boxes represent homology arms. Black arrows represent the coding sequences of VpreB1 and mouse λ5.

FIGS. 3A & B: Schematic maps of insert (FIG. 3A) a human BAC (FIG. 3B) containing correctly targeted human VpreB1, λ5 and mouse regulatory elements.

FIG. 4: Schematic representation of the steps involved in replacement of the mouse λ5 with human λ5.

DETAILED DESCRIPTION OF THE INVENTION

The invention is useful for promoting the appropriate human IGH repertoire on mature B-cells in the vertebrate (or progeny thereof) or in transgenic vertebrates derived from non-human vertebrate cells. At the transition of pre-BI to pre-BII cells, VH to DH-JH rearrangements are initiated (see, eg, Mårtensson & Ceredig, Immunology 2000, 101, 435-441; Role of the surrogate light chain and the pre-B-cell receptor in mouse B-cell development). Whenever they occur in-frame, a mu chain can be produced and presented on the cell surface only if it associates with the surrogate light chain to form the pre-B-cell receptor (pre-BCR). The pre-BCR on the cell surface is required for clonal expansion of pre-BII cells before light chain rearrangement. Compromised presentation of pre-BCR on the cell surface due to the impairment of association of a mu chain carrying a certain variable region with a surrogate light chain will result in no clonal expansion of such a clone and reduce the representation of this variable region in the mature B cells. Human VpreB1 only shares a low identity with rodent VpreB1/2, suggesting that species specificity for the pre-BCR assembly exists. To this end, the inventors realised the importance for appropriate VpreB/variable region pairing on generation of effective antibody diversity in the context of an antibody heavy chain transgene encoding human variable regions, when this transgene is harboured in a non-human environment such as a transgenic mouse or rat.

The CH1 constant region pairs with the λ5 protein. Together, the VpreB and λ5 make up the surrogate light chain (SLC) that pairs with mu heavy chains to form the pre-BCR. For H2 antibodies pairing with the CH1 is not possible (since no CH1 is present), but the desirability to pair human VH regions with human VpreB according to the invention still applies.

Pairing Human Variable Regions of Pre-BCRs with Human VpreB

The extent of homology between human VpreB with mouse and rat VpreB is relatively low (about 73% amino acid identity). As discussed above, the inventors have realised the advantage of matching human antibody heavy chain variable regions with human VpreB in receptors of pre-B cells for promoting B-cell development in transgenic non-human vertebrates bearing human antibody variable region genes.

In a first configuration, the invention thus provides:—

A non-human vertebrate (eg, a mouse or rat) or cell (eg, a mouse cell or rat cell) whose genome comprises an antibody heavy chain transgene, the transgene comprising

(a) one or more human VH gene segments, one or more human D gene segments and one or more human JH gene segments operably connected upstream of a constant region gene so that the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding an antibody heavy chain having a human variable region and a constant region; or

(b) a rearranged VDJ encoding a human variable region operably connected upstream of a constant region gene so that the transgene encodes (optionally following VDJ combination with the constant region) a chimaeric antibody heavy chain having a human variable region and a constant region;

the genome further comprising a human VpreB gene capable of expressing a human VpreB.

The sequences of VpreB's and λ5's are given below:—

SEQ ID NO: 1

Human VpreB1 nucleotide sequence (GenBank Accession No=NG_029387.1)

SEQ ID NO: 2

Human VpreB1 amino acid sequence (Gen Bank Accession No=NP_009059.1)

SEQ ID NO: 3

Human VpreB3 nucleotide sequence (GenBank Accession No=NC_000022.10)

SEQ ID NO: 4

Human VpreB3 amino acid sequence (Gen Bank Accession No=NP_037510)

SEQ ID NO: 5

Mouse VpreB1 nucleotide sequence (Gen Bank Accession No=NC_000082.5)

SEQ ID NO: 6

Mouse VpreB1 amino acid sequence (GenBank Accession No=NP_058678)

SEQ ID NO: 7

Mouse VpreB2 nucleotide sequence (Gen Bank Accession No=NC_000082)

SEQ ID NO: 8

Mouse VpreB2 amino acid sequence (GenBank Accession No=NP_058679.1)

SEQ ID NO: 9

Rat VpreB1 nucleotide sequence (Gen Bank Accession No=NM_001108845.1)

SEQ ID NO: 10

Rat VpreB1 amino acid sequence (GenBank Accession No=NP_001102315.1)

SEQ ID NO: 11

Rat VpreB2 nucleotide sequence (Gen Bank Accession No=NC_005110)

SEQ ID NO: 12

Rat VpreB2 amino acid sequence (GenBank Accession No=NP_001128260)

SEQ ID NO: 13

Human λ5 nucleotide sequence (GenBank Accession No=NG_009791)

SEQ ID NO: 14

Human λ5 amino acid sequence (GenBank Accession No=NP_064455)

SEQ ID NO: 15

Mouse λ5 nucleotide sequence (Gen Bank Accession No=AC_000038)

SEQ ID NO: 16

Mouse λ5 amino acid sequence (Gen Bank Accession No=NP_001177254)

SEQ ID NO: 17

Rat λ5 nucleotide sequence (Gen Bank Accession No=NC_005110)

SEQ ID NO: 18

Rat λ5 amino acid sequence (Gen Bank Accession No=NP_001177270)

All sequences listed under these GenBank Accession numbers are incorporated herein by reference as though explicitly written herein and for possible use in the present invention and possible inclusion of any claim below.

In one embodiment of any configuration of the invention, the human VpreB is human VpreB1. In another embodiment, the human VpreB is human VpreB3.

TABLE 1 Sequence Identity: VpreB: Human vs. Mouse vs. Rat Human Mouse Mouse Rat Rat VpreB1 VpreB1 VpreB2 VpreB1 VpreB2 Human VpreB1 72.5 72.5 72.6 72.6 Mouse VpreB1 97.2 84.4 83.7 Mouse VpreB2 86.7 85.9 Rat VpreB1 99.3 Rat VpreB2

TABLE 2 Sequence Identity: Lambda 5: Human vs. Mouse vs. Rat Human λ5 Mouse λ5 Rat λ5 Human λ5 55.8 57.2 Mouse λ5 90.4 Rat λ5

All nucleotide co-ordinates for the mouse are those corresponding to NCBI m37 for the mouse C57BL/6J strain, e.g. April 2007 ENSEMBL Release 55.37h, e.g. NCBI37 July 2007 (NCBI build 37) (e.g. UCSC version mm9 see www.genome.ucsc.edu and http://genome.ucsc.edu/FAQ/FAQreleases.html) unless otherwise specified. Human nucleotides coordinates are those corresponding to GRCh37 (e.g. UCSC version hg 19, http://genome.ucsc.edu/FAQ/FAQreleases.html), February 2009 ENSEMBL Release 55. or are those corresponding to NCBI36, Ensemble release 54 unless otherwise specified. Rat nucleotides are those corresponding to RGSC 3.4 Dec. 2004 ENSEMBL release 55.34w, or Baylor College of Medicine HGSC v3.4 Nov. 2004 (e.g., UCSC rn4, see www.genome.ucsc.edu and http://genome.ucsc.edu/FAQ/FAQreleases.html) unless otherwise specified.

In one example of the vertebrate or cell, the genome is homozygous for said transgene and endogenous non-human vertebrate antibody heavy chain expression has been inactivated. Alternatively or additionally, endogenous non-human vertebrate antibody light chain expression has been inactivated. Details on possible methods of inactivation are explained in more detail below.

The present invention is applicable to the production of 4-chain antibodies in transgenic animals, where the antibodies each contain 2 heavy chains and 2 light chains. Alternatively, the invention can be applied to the production of H2 antibodies (heavy chain antibodies) which are devoid of CH1 and light chains (see, eg, Nature. 1993 Jun. 3; 363(6428):446-8; Naturally occurring antibodies devoid of light chains; Hamers-Casterman C, Atarhouch T, Muyldermans S, Robinson G, Hamers C, Songa E B, Bendahman N, Hamers R). These antibodies function to specifically bind antigen, such antibodies being found in the blood of Camelidae (eg, llamas, camels, alpacas). Such antibodies with VH pairs can also be synthetically produced to provide therapeutic and prophylactic medicaments (eg, see WO1994004678, WO2004041862, WO2004041863). Transgenic mice also can produce such heavy chain antibodies and the in vivo production of the antibodies allows the mouse's immune system to select for human VH-VH pairings, sometimes selecting for such pairings in which mutations have been introduced in vivo by the mouse to accommodate the pairing (WO2010109165A2). Thus, in an embodiment of the present invention, the heavy chain transgene is devoid of a CH1 gene segment and the genome comprises no functional antibody light chain locus.

Throughout this text, and with application to any configuration, aspect, embodiment or example of the invention, the term “endogenous” (eg, endogenous constant region gene or endogenous λ5) in relation to a non-human vertebrate or cell indicates that the constant region, λ5 etc is a type of constant region or λ5 that is normally found in the vertebrate or cell (as opposed to an exogenous constant region or λ5 whose sequence is not normally found in such a vertebrate or cell). For example, the endogenous constant region and λ5 can be those encoded by the wild-type genome of the non-human vertebrate/cell. So, in an example wherein the vertebrate cell is a mouse ES cell, the endogenous constant region and λ5 would be mouse constant region and λ5. Going further, the endogenous regions are, in an example, strain-matched to the vertebrate/cell. So, in one embodiment, the vertebrate cell is a mouse 129 ES cell, the endogenous constant region and λ5 would be mouse 129 constant region and λ5. In another embodiment, the vertebrate cell is a mouse JM8 ES cell, the endogenous constant region and λ5 would be mouse JM8 constant region and λ5. In another embodiment, the vertebrate is a Black 6 mouse, the endogenous constant region and λ5 would be mouse Black 6 constant region and λ5.

In any configuration, aspect, embodiment or example of the invention, the constant region of the heavy chain transgene is a non-human vertebrate constant region (eg, mouse or rat constant region). Optionally, the constant region is endogenous to said non-human vertebrate or cell. Alternatively, the constant region of the heavy chain transgene is human constant region. For example, the constant region is human and devoid of a CH1. This is useful for producing human H2 antibodies (especially when the vertebrate or cell is not capable of expressing light chains).

In one example of the vertebrate or cell of the invention, the constant region is a Cmu, eg, a mouse or rat Cmu. For example, where the vertebrate is a mouse (or cell is a mouse cell), the Cmu is an endogenous mouse Cmu. The transgene, in an example, comprises a Smu switch 5′ of the Cmu and a Cgamma 3′ of the Cmu, with a S gamma switch between the Cmu and Cgamma. In an embodiment, the Cmu, Cgamma and switches are endogenous mouse C regions and switches. For example, the C regions and switches are mouse 129 C regions and switches; or the C regions and switches are mouse Black 6 C regions and switches. In another embodiment, the S gamma and C regions are mouse S gamma and C regions, and the Smu is a rat Smu.

In an example of any configuration, aspect, embodiment or example of the invention, the human VpreB gene has a nucleotide sequence that is at least 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to SEQ ID NO: 1. The VpreB gene encodes a human VpreB that pairs with the heavy chain human variable region. In an example, the human VpreB gene has a nucleotide sequence that is identical to SEQ ID NO:1.

In an example of any configuration, aspect, embodiment or example of the invention, the human VpreB gene encodes a human VpreB comprising an amino acid sequence that is at least 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99% identical to SEQ ID NO: 2. The VpreB gene encodes a human VpreB that pairs with the heavy chain human variable region. In an example, the human VpreB gene encodes a human VpreB comprising an amino acid sequence that is identical to SEQ ID NO:2; or optionally with from one to 15 amino acid changes (eg, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15 changes) compared with SEQ ID NO: 2.

In any configuration, aspect, embodiment or example of the invention, the human VpreB gene encodes a human VpreB that pairs with the heavy chain human variable region, the human VpreB gene having from one to 25 nucleotide changes (eg, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25 changes) compared to human VpreB (SEQ ID NO: 1). The VpreB gene can be manipulated in vitro by creation of a targeting vector comprising the gene sequence and homology arms flanking the sequence. This can be introduced into the genome of, for example an ES cell, using the standard technique of homologous recombination as will be apparent to the skilled person. Alternatively, recombinase mediated cassette exchange (RMCE)—another standard technique—can be used to effect precise insertion of the VpreB gene into the genome.

Homologous recombination or RMCE are generally-useful techniques for the specific, targeted, introduction of sequences into a genome. One or both of these techniques are readily applicable for introducing into a vertebrate or cell genome a human VpreB sequence (wild-type or mutant); V, D, J and C regions; switches; enhancers and other immunoglobulin control elements; and λ5 genes. These and other techniques for target insertion are known to the skilled addressee.

In an example of any configuration, aspect, embodiment or example of the invention, the genome does not comprise a non-human vertebrate (eg, mouse or rat) species VpreB1 and/or VpreB2 gene. In this embodiment, the possibility of non-human VpreB competing with the human VpreB for binding to the human variable regions is eliminated. Thus, pairing of human V regions with human VpreB on pre-B cells is not hampered by such competition, which is advantageous for promoting B-cell development.

In an example of any configuration, aspect, embodiment or example of the invention, the genome further comprises a λ5 gene (eg, a human, mouse or rat λ5); optionally wherein the λ5 gene is a λ5 of a non-human vertebrate (eg, mouse or rat) species and said constant region gene is a constant region gene of the same non-human vertebrate (eg, mouse or rat) species as the λ5 gene. For example, the constant region and λ5 genes are mouse constant region and λ5 genes, eg, mouse 129 constant region and λ5 genes; or mouse JM8 constant region and λ5 genes). For example, the constant region and λ5 genes are rat constant region and λ5 genes. In an example, constant region gene and λ5 gene are endogenous genes of said vertebrate or cell. Thus, the invention contemplates close matching of not only the V-region/VpreB part of pre-BCRs but also close matching of the Cmu/λ5 part. While not wishing to be bound to any theory, the inventors believe that this is useful for stabilising the pre-BCRs to promote B-cell development in vivo, thus improving the B-cell pool (and potentially diversity) from which to select antigen-specific antibodies following immunisation of vertebrates according to the inventions.

In another embodiment of any configuration, aspect, embodiment or example of the invention, the λ5 is a chimaeric λ5 comprising a non-human vertebrate λ5 (eg, endogenous mouse or rat λ5) in which amino acid sequences have been replaced by corresponding amino acids from a human λ5. For example, the region of λ5 that interfaces with VpreB to form the SLC is replaced with the corresponding part from a human λ5 to form a chimaeric λ5 with a human part (ie, species-matched with the human VpreB) and a non-human vertebrate (eg, mouse or rat) part that is species matched with the CH1 of the non-human vertebrate Cmu of the heavy chain transgene. In the alternative embodiment where the λ5 is a human λ5, the λ5 is advantageously species-matched to interface with the human λ5.

In one aspect the non-human vertebrate is able to generate a diversity of at least 1×10⁶ different functional chimaeric antibody sequence combinations.

In a second configuration, the method provides

A method of constructing a transgenic non-human vertebrate cell (eg, an ES cell, eg, a mouse or rat ES cell), the method comprising

(i) introducing into the genome of a non-human vertebrate cell (or an ancestor thereof) one or more human VH gene segments, one or more human D gene segments and one or more human JH gene segments so that said gene segments are operably connected upstream of a constant region gene (optionally an endogenous non-human vertebrate constant region gene) to form a heavy chain transgene, wherein in said cell or a progeny thereof the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding an antibody heavy chain having a human variable region and a constant region; or

(ii) introducing into the genome of a non-human vertebrate cell (or an ancestor thereof) a rearranged VDJ encoding a human variable region so that said VDJ is operably connected upstream of a constant region gene (optionally an endogenous non-human vertebrate constant region gene) to form a heavy chain transgene, wherein in said cell or a progeny thereof the transgene encodes (optionally following VDJ combination with the constant region) an antibody heavy chain having a human variable region and a constant region;

and wherein the method further comprises introducing into the cell (before, after or concomitantly with step (i) or (ii)) a human VpreB gene capable of expressing a human VpreB.

Targeting of the genome of an ES cell to produce a transgenic cell and subsequently a transgenic vertebrate (eg, a mouse) may be carried out using a protocol as explained by reference to the FIGS. 1-18 and description in WO2011004129, the disclosure of which is incorporated herein by reference. A suitable mouse ES cell is selected from AB2.1 cells (available from Baylor College of Medicine, Texas, USA), AB2.2 cells and JM8 cells.

In any configuration, aspect, embodiment or example of the invention, the cell is an ES cell is capable of developing into a non-human mammal able to produce a repertoire of antibodies which are chimaeric, said chimaeric antibodies having a non-human mammal constant region and a human variable region. Optionally the genome of the cell is modified to prevent expression of fully host-species specific antibodies.

In one aspect the cell is an induced pluripotent stem cell (iPS cell).

In one aspect cells are isolated non-human mammalian cells.

In one aspect a cell as disclosed herein is preferably a non-human mammalian cell.

In one aspect the cell is a cell from a mouse strain selected from C57BL/6, M129 such as 129/SV, BALB/c, and any hybrid of C57BL/6, M129 such as 129/SV, or BALB/c. In an example, the mouse is a C57BL/6-129/Sv hybrid.

Maintaining the performance of the ES cell clones through multiple rounds of manipulation without the need to test the germ line potential of the ES cell line at every step may be important in the method of constructing a transgenic locus. The cell lines currently in use for the KOMP and EUCOMM global knockout projects have been modified twice prior to their use for this project and their germ line transmission rates are unchanged from the parental cells (these lines are publicly available, see www.komp.org and www.eucomm.org). This cell line, called JM8, can generate 100% ES cell-derived mice under published culture conditions (Pettitt, S. J., Liang, Q., Rairdan, X. Y., Moran, J. L., Prosser, H. M., Beier, D. R., Lloyd, K. C., Bradley, A., and Skarnes, W. C. (2009). Agouti C57BL/6N embryonic stem cells for mouse genetic resources. Nature Methods.). These cells have demonstrated ability to reproducibly contribute to somatic and germ line tissue of chimaeric animals using standard mouse ES cell culture conditions. This capability can be found with cells cultured on a standard feeder cell line (SNL) and even feeder-free, grown only on gelatine-coated tissue culture plates. One particular sub-line, JM8A3, maintained the ability to populate the germ line of chimeras after several serial rounds of sub-cloning. Extensive genetic manipulation via, for example, homologous recombination—as could be the case in the present method—cannot compromise the pluripotency of the cells. The ability to generate chimeras with such high percentage of ES cell-derived tissue has other advantages. First, high levels of chimaerism correlates with germ line transmission potential and provide a surrogate assay for germ line transmission while only taking 5 to 6 weeks. Second, since these mice are 100% ES cell derived the engineered loci can be directly tested, removing the delay caused by breeding. Testing the integrity of the new Ig loci is possible in the chimera since the host embryo will be derived from animals that are mutant for the RAG-1 gene as described in the next section.

Another cell line that may be used is an HPRT-negative cell line, such as AB2.1, as disclosed in “Chromosome engineering in mice, Ramirez-Solis R, Liu P and Bradley A, Nature 1995; 378; 6558; 720-4.

RAG-1 Complementation

While many clones will generate 100% ES derived mice some will not. Thus, at every step mice can be generated in a RAG-1-deficient background. This provides mice with 100% ES-derived B- and T-cells which can be used directly for immunization and antibody production. Cells having a RAG-2 deficient background, or a combined RAG-1/RAG-2 deficient background may be used, or equivalent mutations in which mice produce only ES cell-derived B cells and/or T cells.

In order that only the human-mouse IgH (and optionally also light chain) loci are active in these mice, the human-mouse IgH (and optionally light chain loci) can be engineered in a cell line in which one allele of the IgH (and optionally light chain) locus has already been inactivated. Alternatively the inactivation of the host Ig locus/loci can be carried out after insertion.

Mouse strains that have the RAG-1 gene mutated are immunodeficient as they have no mature B- or T-lymphocytes (U.S. Pat. No. 5,859,307). T- and B-lymphocytes only differentiate if proper V(D)J recombination occurs. Since RAG-1 is an enzyme that is crucial for this recombination, mice lacking RAG-1 are immunodeficient. If host embryos are genetically RAG-1 homozygous mutant, a chimera produced by injecting such an embryo will not be able to produce antibodies if the animal's lymphoid tissues are derived from the host embryo. However, JM8 cells and AB2.1 cells, for example, generally contribute in excess of 80% of the somatic tissues of the chimaeric animal and would therefore usually populate the lymphoid tissue. JM8 cells have wild-type RAG-1 activity and therefore antibodies produced in the chimaeric animal would be encoded by the engineered JM8 ES cell genome only. Therefore, the chimaeric animal can be challenged with an antigen by immunization and subsequently produce antibodies to that antigen. This allows one skilled in the art to test the performance of the engineered human/mouse Ig loci as described in the present invention.

One skilled in the art could use the chimaeric animal as described to determine the extent of antibody diversity (see e.g. Harlow, E. & Lane, D. 1998, 5^(th) edition, Antibodies: A Laboratory Manual, Cold Spring Harbor Lab. Press, Plainview, N.Y.). For example, the existence in the chimaeric animal's serum of certain antibody epitopes could be ascertained by binding to specific anti-idiotype antiserum, for example, in an ELISA assay. One skilled in the art could also sequence the genomes or heavy chain RNA or DNA sequences of B-cell clones derived from the chimaeric animal (naïve or immunised) and compare said sequence to wild-type sequence (ie, minus human VpreB) to assess the B-cell repertoire and ascertain the level of hypermutation, such hypermutation being indicative of normal antibody maturation.

One skilled in the art would also use said chimaeric animal to examine antibody function wherein said antibodies are encoded from the engineered Ig loci (see e.g. Harlow, E. & Lane, D. 1998, 5^(th) edition, Antibodies: A Laboratory Manual, Cold Spring Harbor Lab. Press, Plainview, N.Y.). For example, antisera could be tested for binding an antigen, said antigen used to immunize the chimaeric animal. Such a measurement could be made by an ELISA assay. Alternatively, one skilled in the art could test for neutralization of the antigen by addition of the antisera collected from the appropriately immunized chimaeric animal.

It is well known to those skilled in the art that positive outcomes for any of these tests demonstrate the ability of the engineered Ig loci in the presence of VpreB during B-cell development according to the instant invention, to encode antibodies with human variable regions and non-human vertebrate constant regions, said antibodies capable of functioning in the manner of wild-type antibodies.

The introduction of human gene segment DNA into the genome can be carried out using vectors carrying the DNA, as more fully explained below. In one aspect such vectors are BACs (bacterial artificial chromosomes). It will be appreciated that other cloning vectors may be used in the invention, and therefore reference to BACs herein may be taken to refer generally to any suitable vector. As described in WO2011004129, in one aspect the inserted DNA is built up in the genome of a cell, such as an ES cell, in a stepwise manner using 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20 or more separate insertions for the heavy chain gene segments. Fragments are suitably inserted at the same or substantially the same cell locus, e.g. ES cell locus, one after another, to form the complete VDJ region.

In a further aspect the method comprises the insertion of human heavy chain gene segments upstream of the non-human vertebrate constant region by step-wise insertion of multiple fragments by homologous recombination, preferably using an iterative process. Suitably fragments of approximately 100 KB from the human VDJ locus are inserted, suitably to form a VDJ region after the final iteration of the insertion process, as described in WO2011004129.

In one aspect the insertion process commences at a site where an initiation cassette has been inserted into the genome of a cell, such as an ES cell, providing a unique targeting region. In one aspect the initiation cassette is inserted in the non-human vertebrate heavy chain locus, for use in insertion of human heavy chain DNA. The initiation cassette suitably comprises a vector backbone sequence with which a vector having a human DNA fragment in the same backbone sequence can recombine to insert the human DNA into the cell (e.g. ES) cell genome, and suitably a selection marker, such as a negative selection marker. Suitably the vector backbone sequence is that of a BAC library, to allow BACs to be used in the construction of the ES cells and mammals. The vector backbone sequence may however be any sequence which serves as a target site into which a homologous sequence can insert, for example by homologous recombination and, for example RMCE, and is optionally not DNA encoding any of the VDJ or constant region.

Suitable BACs are available from the Sanger centre, see “A genome-wide, end-sequenced 129Sv BAC library resource for targeting vector construction”. Adams D J, Quail M A, Cox T, van der Weyden L, Gorick B D, Su Q, Chan W I, Davies R, Bonfield J K, Law F, Humphray S, Plumb B, Liu P, Rogers J, Bradley A. Genomics. 2005 December; 86(6):753-8. Epub 2005 Oct. 27. The Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire CB10 1SA, UK. BACs containing human DNA are also available from, for example, Invitrogen. A suitable library (source of human heavy chain gene segments) is described in Osoegawa K et al, Genome Research 2001. 11: 483-496 and obtainable from the Roswell Park Cancer Institute (USA)/Invitrogen.

In any configuration, aspect, embodiment or example of the invention, non-human vertebrates, such as mice, are generated in a RAG-1 or RAG-2-deficient background, or other suitable genetic background which prevents the production of mature host B and T lymphocytes.

In any configuration, aspect, embodiment or example of the invention, the non-human vertebrate is a rodent, suitably a mouse, and cells of the invention, are rodent cells or ES cells, suitably mouse ES cells.

The ES cells of the present invention can be used to generate animals using techniques well known in the art, which comprise injection of the ES cell into a blastocyst followed by implantation of chimaeric blastocystys into females to produce offspring which can be bred and selected for homozygous recombinants having the required insertion. In one aspect the invention relates to a chimeric non-human vertebrate comprised of ES cell-derived tissue and host embryo derived tissue, wherein the ES cell is according to the invention. In one aspect the invention relates to genetically-altered subsequent generation non-human vertebrates, which include vertebrates homozygous for the human VDJ region.

In one any configuration, aspect, example or embodiment, the genome of the cell or vertebrate comprises: one or more human Ig light chain kappa V regions and one or more human Ig light chain kappa J regions upstream of all or part of a kappa constant region.

In one any configuration, aspect, example or embodiment, the genome of the cell or vertebrate comprises: one or more human Ig light chain lambda V regions and one or more human Ig light chain lambda J regions upstream of all or part of a lambda constant region.

Suitably the light chain Viand C regions are able to form antibody light chains in vivo capable of forming an antibody with one or more of the chimaeric heavy chains to form an antibody that binds a predetermined antigen.

In such aspects a human kappa and/or lambda region is inserted into the genome, in combination with insertion of the heavy chain VDJ region or part thereof, upstream of the host heavy chain constant region as disclosed herein.

Suitably the insertion of the human VJC light chain DNA, or part thereof as disclosed above, is made at the equivalent mouse locus. In one aspect the human light chain kappa VJC DNA, or part thereof, is inserted immediately upstream or downstream of the mouse kappa VJC region. In one aspect, the human light chain lambda VJC region or part thereof is inserted immediately upstream or downstream of the mouse lambda VJC region. In one aspect only the human kappa VJC locus is inserted and not the human lambda VJC locus. In one aspect only the human lambda VJC locus is inserted and not the human kappa VJC locus. Insertions may be made using the techniques disclosed herein, and suitably do not remove the host sequences from the genome. In one aspect the non-human mammal host VJC sequences may be inactivated in some way, by mutation, or inversion, or by insertion of the human variable region DNA, or by any other means. In one aspect the cell or non-human mammal of the invention may comprise an insertion of the complete VJC human region.

The human kappa variable region DNA might be inserted into the genome in functional arrangement with a lambda constant region, for example inserted upstream of a lambda constant region. Alternatively human lambda region variable DNA might be inserted in functional arrangement with a kappa constant region, for example inserted upstream of a kappa constant region.

In one any configuration, aspect, example or embodiment, the genome of the cell or vertebrate comprises: one or more non-human vertebrate control sequences such as the enhancer sequence(s) is maintained upstream of the nonhuman vertebrate Mu constant region, suitably in its native position with respect to the distance from the constant region.

In one any configuration, aspect, example or embodiment, one or more non-human mammal control sequences such as an enhancer sequence(s) are maintained downstream of the nonhuman vertebrate Mu constant region, suitably in its native position with respect to the distance from the constant region.

In one any configuration, aspect, example or embodiment, a non-human mammal switch sequence, suitably the endogenous switch sequence, is maintained upstream of the non-human vertebrate Mu constant region, suitably in its native position with respect to distance from the constant region.

In such location the host enhancer or switch sequences are operative in vivo with the host constant region sequence(s).

In one aspect a switch sequence is neither human, nor native in the non-human mammal, for example in one aspect a non-human mammal switch sequence is not a mouse or human switch sequence. The switch sequence may be, for example, a rodent or primate sequence, or a synthetic sequence. In particular the switch sequence may be a rat sequence where the non-human mammal is a mouse. By way of example, a mouse or human constant mu sequence may be placed under the control of a switch sequence from a rat, or chimp, or other switch sequence, suitably capable of allowing isotype switching to occur in vivo.

One combination envisaged is a rat switch with mouse enhancer sequences and mouse constant regions in a mouse cell.

In one aspect the human promoter and/or other control elements that are associated with the different human V, D or J regions are maintained in after insertion of the human VDJ into the mouse genome.

The functional replacement of human promoter or other control regions by non-human mammal promoter or control regions may be carried out by use of recombineering, or other recombinant DNA technologies, to insert a part of the human Ig region (such as a human V region) into a vector (such as a BAC) containing a non-human Ig region. The recombineering/recombinant technique suitably replaces a portion of the non-human (e.g. mouse) DNA with the human Ig region, and thus places the human Ig region under control of the non-human mammal promoter or other control region. Suitably the human coding region for a human V region replaces a mouse V region coding sequence. Suitably the human coding region for a human D region replaces a mouse D region coding sequence. Suitably the human coding region for a human J region replaces a mouse J region coding sequence. In this way human V, D or J regions may be placed under the control of a non-human mammal promoter, such as a mouse promoter.

In one any configuration, aspect, example or embodiment, the human DNA inserted into the genome of the non-human vertebrate or cell are placed under control of the host regulatory sequences or other (non-human, non-host) sequences, In one aspect reference to human coding regions includes both human introns and exons, or in another aspect simply exons and no introns, which may be in the form of cDNA.

It is also possible to use recombineering, or other recombinant DNA technologies, to insert a non-human-mammal (e.g. mouse) promoter or other control region, such as a promoter for a V region or

VpreB, into a BAC containing a human Ig region. The A recombineering step then places a portion of human DNA under control of the mouse promoter or other control region.

Generally, insertion of human variable region DNA at or close to the equivalent endogenous locus in the recipient genome is preferred, for example within 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 kb of the boundary (upstream or downstream) of a host immunoglobulin locus.

In one embodiment of the method of the second configuration of the invention, said genome is homozygous for said transgene and endogenous non-human vertebrate antibody heavy chain expression is inactivated.

In one embodiment of the method of the second configuration of the invention, endogenous non-human vertebrate antibody light chain expression is inactivated.

In one embodiment of the method of the second configuration of the invention, the heavy chain transgene is devoid of a CH1 gene segment and the genome comprises no functional antibody light chain locus.

In one embodiment of the method of the second configuration of the invention, said constant region is a Cmu. For example, the Cmu is a non-human vertebrate Cmu or an endogenous Cmu of said vertebrate or cell.

In one embodiment of the method of the second configuration of the invention, said genome does not comprise a non-human vertebrate (eg, mouse or rat) species VpreB1 and/or VpreB2 gene.

In one embodiment of the method of the second configuration of the invention, the genome further comprises a λ5 gene; optionally wherein the λ5 gene is a λ5 of a non-human vertebrate (eg, mouse or rat) species and said constant region gene is a constant region gene of the same non-human vertebrate (eg, mouse or rat) species as the λ5 gene.

In one embodiment of the method of the second configuration of the invention, said constant region gene and λ5 gene are endogenous genes of said vertebrate or cell.

In one embodiment of the method of the second configuration of the invention, the method comprises making a progeny of the cell made according to the method, wherein the progeny is homozygous for said heavy chain transgene and endogenous non-human vertebrate antibody expression has been inactivated.

A third configuration of the invention provides

A method of promoting B-cell development in a non-human vertebrate (eg, a mouse or rat), the method comprising

(a) providing in a non-human vertebrate embryonic stem (ES) cell genome an immunoglobulin transgene capable of expressing an antibody mu heavy chain, wherein the antibody heavy chain comprises a human variable region and a mu constant region (optionally an endogenous non-human vertebrate mu constant region); and creating a first non-human vertebrate from said ES cell or a progeny thereof;

(b) providing in a second ES cell genome a second transgene comprising a human VpreB gene capable of expressing a human VpreB; and creating a second non-human vertebrate from said ES cell or a progeny thereof; and

(c) creating by breeding a third non-human vertebrate capable of co-expressing the mu antibody heavy chain and human vpreB wherein a pre-B-cell receptor can form to promote B-cell development of cells bearing a mu heavy chain in said third vertebrate; the third vertebrate being made by crossing said first and second vertebrates or progeny thereof by breeding to create said third vertebrate, the third vertebrate comprising the first and second transgenes, and wherein endogenous heavy chain expression has been inactivated in said third vertebrate.

In one embodiment of the method of the third configuration of the invention, in (a) the heavy chain transgene is constructed to be devoid of a CH1 gene segment and the genome of the third non-human vertebrate comprises no functional antibody light chain locus.

In one embodiment of the method of the third configuration of the invention, the genome of the third vertebrate does not comprise a non-human vertebrate (eg, mouse or rat) species VpreB1 and/or VpreB2 gene.

In one embodiment of the method of the third configuration of the invention, the third non-human vertebrate expresses a λ5 of a non-human vertebrate (eg, mouse or rat) species and said constant region is a constant region of the same non-human vertebrate (eg, mouse or rat) species as the λ5 gene; optionally wherein the λ5 and constant region are an endogenous λ5 and constant region of mouse or rat.

The invention further provides a transgenic mouse according to the third vertebrate, or a progeny thereof.

In a fourth aspect, the invention provides

A method of promoting B-cell development in a non-human vertebrate (eg, a mouse or rat), the method comprising

(i) inactivating endogenous heavy chain expression in said vertebrate;

(ii) providing in the genome of said vertebrate an immunoglobulin transgene capable of expressing an antibody mu heavy chain, wherein the antibody heavy chain comprises a human variable region and a non-human vertebrate mu constant region (optionally an endogenous non-human vertebrate mu constant region); and

(ii) providing in the genome of said vertebrate a second transgene capable of expressing a human VpreB wherein a pre-B-cell receptor can form to promote B-cell development of cells bearing a mu heavy chain in said vertebrate;

Optionally wherein the genome is homozygous for said first transgene.

In one embodiment of the method of the fourth configuration of the invention, the heavy chain transgene is devoid of a CH1 gene segment and the genome of the vertebrate comprises no functional antibody light chain locus.

In one embodiment of the method of the fourth configuration of the invention, the genome of the vertebrate does not comprise a non-human vertebrate (eg, mouse or rat) species VpreB1 and/or VpreB2 gene.

In one embodiment of the method of the fourth configuration of the invention, the non-human vertebrate expresses a λ5 of a non-human vertebrate (eg, mouse or rat) species and said constant region is a constant region of the same non-human vertebrate (eg, mouse or rat) species as the λ5 gene; optionally wherein the λ5 and constant region are an endogenous λ5 and constant region of mouse or rat.

The invention also provides a transgenic mouse according to the fourth configuration, or a progeny thereof.

Chimaeric Surrogate Light Chains

The extent of homology between human λ5 with mouse and rat λ5 is relatively low (about 56% amino acid identity). The inventors realised the advantage of species-matching the λ5 in the SLC in a transgenic non-human vertebrate with the species of mu constant region in the pre-BCRs in such transgenic animals. Thus, the invention further provides aspects that provide for the possibility of species-matching in vivo the variable and constant regions of the antibody heavy chain with both components (VpreB and λ5) of a chimaeric SLC. This is useful for promoting B-cell development in the transgenic non-human vertebrate (eg, a mouse or rat).

Thus, in an embodiment of the vertebrates, cells and methods of the invention, the genome further comprises a λ5 gene; optionally wherein the λ5 gene is a λ5 of a non-human vertebrate (eg, mouse or rat) species and said constant region gene is a constant region gene of the same non-human vertebrate (eg, mouse or rat) species as the λ5 gene. For example, the vertebrate is a mouse or rat (or cell is a mouse or rat cell) and the λ5 is a λ5 that is endogenous to the mouse or rat (or mouse or rat cell). In this example, the constant region of the heavy chain transgene is an constant region (eg, Cmu) endogenous to the mouse or rat. In this way, the λ5 is matched for pairing to the constant region at the cell surface in the pre-BII stage, and the human VpreB is matched for pairing with the human variable region encoded by the transgene.

To this end, the invention provides in a fifth configuration

A non-human vertebrate (eg, a mouse or rat) or cell (eg, a mouse cell or rat cell) whose genome comprises an antibody heavy chain transgene,

the transgene comprising

(a) one or more human VH gene segments, one or more human D gene segments and one or more human JH gene segments operably connected upstream of a constant region gene of a non-human vertebrate (eg, mouse or rat) species so that the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding a chimaeric antibody heavy chain having a human variable region and a non-human vertebrate constant region; or

(b) a rearranged VDJ encoding a human variable region operably connected upstream of a constant region gene of a non-human vertebrate (eg, mouse or rat) species so that the transgene encodes (optionally following VDJ combination with the constant region) a chimaeric antibody heavy chain having a human variable region and a non-human vertebrate constant region;

the genome further comprising one or more genes together encoding a chimaeric surrogate light chain, the surrogate light chain comprising a human VpreB and λ5 of said non-human vertebrate (eg, mouse or rat) species.

In one embodiment of the method of the fifth configuration of the invention, said genome is homozygous for said transgene and endogenous non-human vertebrate antibody heavy chain expression has been inactivated.

In one embodiment of the method of the fifth configuration of the invention, said constant region gene is a Cmu gene.

In one embodiment of the method of the fifth configuration of the invention, said constant region gene and λ5 gene are endogenous genes of said vertebrate or cell.

In one embodiment of the method of the fifth configuration of the invention, said genome does not comprise a non-human vertebrate (eg, mouse or rat) species VpreB1 and/or VpreB2 gene.

In a sixth configuration, the invention further provides

A method of constructing a transgenic non-human vertebrate cell (eg, an ES cell, eg, a mouse or rat ES cell), the method comprising

(i) introducing into a non-human vertebrate cell (or an ancestor thereof) one or more human VH gene segments, one or more human D gene segments and one or more human JH gene segments so that said gene segments are operably connected upstream of an endogenous non-human vertebrate constant region gene, wherein in said cell or a progeny thereof the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding a chimaeric antibody heavy chain having a human variable region and a non-human vertebrate constant region; or

(ii) introducing into a non-human vertebrate cell (or an ancestor thereof) a rearranged VDJ encoding a human variable region so that said VDJ is operably connected upstream of an endogenous non-human vertebrate constant region gene, wherein in said cell or a progeny thereof the transgene encodes (optionally following VDJ combination with the constant region) a chimaeric antibody heavy chain having a human variable region and a non-human vertebrate constant region;

and wherein the method further comprises introducing into the cell a human VpreB gene such that the cell or a progeny thereof is capable of expressing a chimaeric surrogate light chain as well as said chimaeric antibody heavy chain, wherein the surrogate light chain comprises a human VpreB and an endogenous non-human vertebrate λ5.

In one embodiment of the method of the sixth configuration of the invention, said constant region gene is a Cmu gene.

In one embodiment of the method of the sixth configuration of the invention, each endogenous non-human vertebrate VpreB gene is inactivated or deleted from said genome.

In one embodiment, the method of the sixth configuration of the invention comprises making a progeny of the cell made according to the method, wherein the progeny is homozygous for said heavy chain transgene and endogenous non-human vertebrate antibody heavy chain expression has been inactivated.

In a seventh configuration, the invention provides

A method of promoting B-cell development in a non-human vertebrate (eg, a mouse or rat), the method comprising

(a) providing in a non-human vertebrate embryonic stem (ES) cell an immunoglobulin transgene capable of expressing a chimaeric antibody mu heavy chain, wherein the chimaeric antibody heavy chain comprises a human variable region and an endogenous non-human vertebrate mu constant region; and creating a first non-human vertebrate from said ES cell or a progeny thereof;

(b) providing in a second ES cell a second transgene capable of expressing a human VpreB so that the human VpreB and an endogenous λ5 form a chimaeric surrogate light chain; and creating a second non-human vertebrate from said ES cell or a progeny thereof; and

(c) creating by breeding a third non-human vertebrate capable of co-expressing the chimaeric mu antibody heavy chain and chimaeric surrogate light chain wherein a pre-BCR can form to promote B-cell development in said third vertebrate; the third vertebrate being made by crossing said first and second vertebrates or progeny thereof by breeding to create said third vertebrate, the third vertebrate comprising the first and second transgenes, and wherein endogenous heavy chain expression has been inactivated in said third vertebrate.

The invention provides a transgenic mouse according to the vertebrate of this method, or a progeny thereof.

In an eighth configuration, the invention provides

A method of promoting B-cell development in a non-human vertebrate (eg, a mouse or rat), the method comprising

(i) inactivating endogenous heavy chain expression in said vertebrate;

(ii) providing in the genome of said vertebrate an immunoglobulin transgene capable of expressing a chimaeric antibody mu heavy chain, wherein the chimaeric antibody heavy chain comprises a human variable region and an endogenous non-human vertebrate mu constant region;

and

(ii) providing in the genome of said vertebrate a second transgene capable of expressing a human VpreB so that the human VpreB and an endogenous λ5 form a chimaeric surrogate light chain; so that the vertebrate is capable of co-expressing the chimaeric mu antibody heavy chain and chimaeric surrogate light chain wherein a pre-BCR can form to promote B-cell development in said vertebrate;

Optionally wherein the genome is homozygous for said first transgene.

A transgenic mouse according to the vertebrate of this method, or a progeny thereof, is provided.

In any configuration, aspect, embodiment or example of the invention, the inserted human genes may be derived from the same individual or different individuals, or be synthetic or represent human consensus sequences.

Techniques for constructing non-human vertebrates and vertebrate cells whose genomes comprise a transgene containing human V, J and D regions are well known in the art. For example, reference is made to WO2011004192, U.S. Pat. No. 7,501,552, U.S. Pat. No. 6,673,986, U.S. Pat. No. 6,130,364, WO2009/076464 and U.S. Pat. No. 6,586,251, the disclosures of which are incorporated herein by reference in their entirety.

In one embodiment in any configuration of the invention, the vertebrate is a non-human mammal and the vertebrate cell is a non-human mammalian cell. In one embodiment in any configuration of the invention, the vertebrate is a mouse, rat, rabbit, Camelid (eg, a llama, alpaca or camel) or shark; or the vertebrate cell is a mouse, rat, rabbit, Camelid (eg, a llama, alpaca or camel) or shark cell.

In one aspect the human heavy chain gene segments are inserted into the genome so that they are placed under control of the host regulatory sequences (eg, enhancers, promoters and/or switches) or other (non-human, non-host) sequences. In one aspect reference to human coding regions includes both human introns and exons, or in another aspect simply exons and no introns, which may be in the form of cDNA.

Alternatively it is possible to use recombineering, or other recombinant DNA technologies, to insert a non human-vertebrate (e.g. mouse) promoter or other control region, such as a promoter for a V region, into a BAC containing a human Ig region. The recombineering step then places a portion of human DNA under control of the mouse promoter or other control region.

The invention also relates to a cell line which is grown from or otherwise derived from cells as described herein, including an immortalised cell line. The cell line may comprise inserted human V, D or J genes as described herein, either in germline configuration or after rearrangement following in vivo maturation. The cell may be immortalised by fusion to a tumour cell to provide an antibody producing cell and cell line, or be made by direct cellular immortalisation.

In one aspect the non-human vertebrate of any configuration of the invention is able to generate a diversity of at least 1×10⁶ different functional chimaeric immunoglobulin sequence combinations.

Optionally in any configuration of the invention the constant region is endogenous to the vertebrate and optionally comprises an endogenous switch. In one embodiment, the constant region comprises a Cgamma (CY) region and/or a Smu (Sμ) switch. Switch sequences are known in the art, for example, see Nikaido et al, Nature 292: 845-848 (1981) and also WO2011004192, U.S. Pat. No. 7,501,552, U.S. Pat. No. 6,673,986, U.S. Pat. No. 6,130,364, WO2009/076464 and U.S. Pat. No. 6,586,251, eg, SEQ ID NOs: 9-24 disclosed in

U.S. Pat. No. 7,501,552. Optionally the constant region comprises an endogenous S gamma switch and/or an endogenous Smu switch. One or more endogenous switch regions can be provided, in one embodiment, by constructing a transgenic immunoglobulin locus in the vertebrate or cell genome in which at least one human V region, at least one human J region, and optionally at least one human D region, or a rearranged VDJ or VJ region, are inserted into the genome in operable connection with a constant region that is endogenous to the vertebrate or cell. For example, the human V(D)J regions or rearranged VDJ or VJ can be inserted in a cis orientation onto the same chromosome as the endogenous constant region. A trans orientation is also possible, in which the human V(D)J regions or rearranged VDJ or VJ are inserted into one chromosome of a pair (eg, the chromosome 6 pair in a mouse or the chromosome 4 in a rat) and the endogenous constant region is on the other chromosome of the pair, such that trans-switching takes place in which the human V(D)J regions or rearranged VDJ or VJ are spliced inoperable linkage to the endogenous constant region. In this way, the vertebrate can express antibodies having a chain that comprises a variable region encoded all or in part by human V(D)J or a rearranged VDJ or VJ, together with a constant region (eg, a Cgamma or Cmu) that is endogenous to the vertebrate.

Human variable regions are suitably inserted upstream of non-human vertebrate constant region, the latter comprising all of the DNA required to encode the full constant region or a sufficient portion of the constant region to allow the formation of an effective chimaeric antibody capable of specifically recognising an antigen.

In one aspect the chimaeric antibodies or antibody chains have a part of a host constant region sufficient to provide one or more effector functions seen in antibodies occurring naturally in a host vertebrate, for example that they are able interact with Fc receptors, and/or bind to complement.

Reference to a chimaeric antibody or antibody chain having a non-human vertebrate constant region herein therefore is not limited to the complete constant region but also includes chimaeric antibodies or chains which have all of the host constant region, or a part thereof sufficient to provide one or more effector functions. This also applies to non-human vertebrate mammals and cells and methods of the invention in which human variable region DNA may be inserted into the host genome such that it forms a chimaeric antibody chain with all or part of a host constant region. In one aspect the whole of a host non-human vertebrate constant region is operably linked to human variable region DNA.

The host non-human vertebrate constant region herein is optionally the endogenous host wild-type constant region located at the wild type locus, as appropriate for the heavy or light chain. For example, the human heavy chain DNA is suitably inserted on mouse chromosome 12, suitably adjacent the mouse heavy chain constant region, where the vertebrate is a mouse.

In one optional aspect where the vertebrate is a mouse, the insertion of the human DNA, such as the human VDJ region is targeted to the region between the J4 exon and the Cμ locus in the mouse genome IgH locus, and in one aspect is inserted between coordinates 114,667,090 and 114,665,190, suitably at coordinate 114,667,091, after 114,667,090. In one aspect the insertion of the human DNA, such as the human light chain kappa VJ is targeted into mouse chromosome 6 between coordinates 70,673,899 and 70,675,515, suitably at position 70,674,734, or an equivalent position in the lambda mouse locus on chromosome 16.

In one aspect the host non-human vertebrate constant region for forming the chimaeric antibody may be at a different (non endogenous) chromosomal locus. In this case the inserted human DNA, such as the human variable VDJ or VJ region(s) may then be inserted into the non-human genome at a site which is distinct from that of the naturally occurring heavy or light constant region. The native constant region may be inserted into the genome, or duplicated within the genome, at a different chromosomal locus to the native position, such that it is in a functional arrangement with the human variable region such that chimaeric antibodies of the invention can still be produced.

In one aspect the human DNA is inserted at the endogenous host wild-type constant region located at the wild type locus between the host constant region and the host VDJ region.

Reference to location of the variable region upstream of the non-human vertebrate constant region means that there is a suitable relative location of the two antibody portions, variable and constant, to allow the variable and constant regions to form a chimaeric antibody or antibody chain in vivo in the vertebrate. Thus, the inserted human DNA and host constant region are in operable connection with one another for antibody or antibody chain production.

In one aspect the inserted human DNA is capable of being expressed with different host constant regions through isotype switching. In one aspect isotype switching does not require or involve trans switching. Insertion of the human variable region DNA on the same chromosome as the relevant host constant region means that there is no need for trans-switching to produce isotype switching.

In the present invention, optionally host non-human vertebrate constant regions are maintained and it is preferred that at least one non-human vertebrate enhancer or other control sequence, such as a switch region, is maintained in functional arrangement with the non-human vertebrate constant region, such that the effect of the enhancer or other control sequence, as seen in the host vertebrate, is exerted in whole or in part in the transgenic animal. This approach is designed to allow the full diversity of the human locus to be sampled, to allow the same high expression levels that would be achieved by non-human vertebrate control sequences such as enhancers, and is such that signalling in the B-cell, for example isotype switching using switch recombination sites, would still use non-human vertebrate sequences.

A non-human vertebrate having such a genome would produce chimaeric antibodies with human variable and non-human vertebrate constant regions, but these are readily humanized, for example in a cloning step. Moreover the in vivo efficacy of these chimaeric antibodies could be assessed in these same animals.

In one aspect the inserted human IgH VDJ region comprises, in germline configuration, all of the V, D and J regions and intervening sequences from a human. Optionally, non-functional V and/or D and/or J gene segments are omitted. For example, VH which are inverted or are pseudogenes may be omitted.

In one aspect 800-1000 kb of the human IgH VDJ region is inserted into the non-human vertebrate IgH locus, and in one aspect a 940, 950 or 960 kb fragment is inserted. Suitably this includes bases 105,400,051 to 106,368,585 from human chromosome 14.

In one aspect the inserted IgH human fragment consists of bases 105,400,051 to 106,368,585 from chromosome 14. In one aspect the inserted human heavy chain DNA, such as DNA consisting of bases 105,400,051 to 106,368,585 from chromosome 14, is inserted into mouse chromosome 12 between the end of the mouse J4 region and the Eli region, suitably between co-ordinates 114,667,090 and 114,665,190, or at co-ordinate 114,667,091, after 114,667,090. In one aspect the insertion is between co-ordinates 114,667,089 and 114,667,090 (co-ordinates refer to NCBI m37, for the mouse C57BL/6J strain), or at equivalent position in another non-human vertebrate genome.

A cell or non-human vertebrate of the invention, in one embodiment, comprises an insertion of human heavy chain variable region DNA between co-ordinates 114, 666, 183 and 114, 666, 725, such as between 114 666 283 and 114 666 625, optionally between co-ordinates 114,666,335 and 114,666,536, optionally between 114,666,385 and 114,666,486, or between 114,666,425 and 114,666,446, or between 114,666,435 and 114,666,436 of mouse chromosome 12 with reference to NCBIM37 for the mouse genome, relating to mouse strain C57BL/6J or an equivalent position of mouse chromosome 12 from a different mouse strain or an equivalent position in the genome of another non-human vertebrate, e.g., a rat. The insertion between co-ordinates 114,666,435 and 114,666,436 relating to mouse strain C57BL/6J is equivalent to an insertion between co-ordinates 1207826 and 1207827 on chromosome 12 with reference to the 129/SvJ genomic sequence of the geneBank access number NT114985.2. An insertion may be made at equivalent position in another genome, such as another mouse genome. In an example of this embodiment, the cell or mammal of the invention comprises a human IgH VDJ region which comprises or consists of nucleotides 106,328,851-107,268,544, such as nucleotides 106,328,901-107,268,494, such as nucleotides 106,328,941-107,268,454, such as nucleotides 106,328,951-107,268,444 of human Chromosome 14, with reference to the GRCH37/hg19 sequence database, or insertion of equivalent nucleotides relating to chromosome 14 from a different human sequence or database. The human insertion may be made between the regions indicated above.

In one aspect there is inserted into the genome the human kappa VJ region which comprises, in germline configuration, all of the V and J regions and intervening sequences from a human. Optionally, non-functional V and/or J gene segments are omitted.

Suitably this includes bases 88,940,356 to 89,857,000 from human chromosome 2, suitably approximately 917 kb. In a further aspect the light chain VJ insert may comprise only the proximal clusters of V segments and J segments. Such an insert would be of approximately 473 kb.

In one aspect the human light chain kappa DNA, such as the human Igκ fragment of bases 88,940,356 to 89,857,000 from human chromosome 2, is suitably inserted into mouse chromosome 6 between coordinates 70,673,899 and 70,675,515, suitably at position 70,674,734.

In one aspect the human lambda V (or VJ) region is inserted into the genome, which comprises, in germline configuration, all of the V (and optionally J) regions and intervening sequences from a human (which will thus include the human VpreB gene and a promoter). Suitably this includes analogous bases to those selected for the kappa fragment, from human chromosome 2. Optionally, non-functional V and/or J gene segments are omitted.

All specific human fragments described above may vary in length, and may for example be longer or shorter than defined as above, such as 500 bases, 1 KB, 2K, 3K, 4K, 5 KB, 10 KB, 20 KB, 30 KB, 40 KB or 50 KB or more, which suitably comprise all or part of the human V(D)J region, whilst preferably retaining the requirement for the final insert to comprise human genetic material encoding the complete heavy chain region and light chain region, as appropriate, as described above.

In one aspect the 3′ end of the last inserted human sequence, generally the last human J sequence, is inserted less than 2 kb, preferably less than 1 KB from the human/non-human vertebrate (eg, human/mouse or human/rat) join region.

Optionally, the genome is homozygous at at least the IgH, or a second, or all three immunoglobulin loci (IgH, Igλ and Igκ.

In another aspect the genome may be heterozygous at one or more of the loci, such as heterozygous for DNA encoding a chimaeric antibody chain and native (host cell) antibody chain. In one aspect the genome may be heterozygous for DNA capable of encoding 2 different antibody heavy chains encoded by immunoglobulin transgenes of the invention, for example, comprising 2 different chimaeric heavy chains and a human lambda light chain.

In one aspect the invention relates to a non-human vertebrate or cell, and methods for producing said vertebrate or cell, as described herein, wherein the inserted human DNA, such as the human IgH VDJ region and/or light chain V, J regions are found on only one allele and not both alleles in the mammal or cell. In this aspect a mammal or cell has the potential to express both an endogenous host antibody heavy or light chain and a chimaeric heavy or light chain.

In one embodiment in any configuration of the invention, the genome has been modified to prevent or reduce the expression of fully-endogenous antibody. Examples of suitable techniques for doing this can be found in WO2011004192, U.S. Pat. No. 7,501,552, U.S. Pat. No. 6,673,986, U.S. Pat. No. 6,130,364, WO2009/076464, EP1399559 and U.S. Pat. No. 6,586,251, the disclosures of which are incorporated herein by reference. In one embodiment, the non-human vertebrate VDJ region of the endogenous heavy chain immunoglobulin locus, and optionally VJ region of the endogenous light chain immunoglobulin loci (lambda and/or kappa loci), have been inactivated. For example, all or part of the non-human vertebrate VDJ region is inactivated by inversion in the endogenous heavy chain immunoglobulin locus of the mammal, optionally with the inverted region being moved upstream or downstream of the endogenous Ig locus. For example, all or part of the non-human vertebrate VJ region is inactivated by inversion in the endogenous kappa chain immunoglobulin locus of the mammal, optionally with the inverted region being moved upstream or downstream of the endogenous Ig locus. For example, all or part of the non-human vertebrate VJ region is inactivated by inversion in the endogenous lambda chain immunoglobulin locus of the mammal, optionally with the inverted region being moved upstream or downstream of the endogenous Ig locus. In one embodiment the endogenous heavy chain locus is inactivated in this way as is one or both of the endogenous kappa and lambda loci.

Additionally or alternatively, the vertebrate has been generated in a genetic background which prevents the production of mature host B and T lymphocytes, optionally a RAG-1-deficient and/or RAG-2 deficient background. See U.S. Pat. No. 5,859,301 for techniques of generating RAG-1 deficient animals.

In one embodiment in any configuration of the invention, the human V, J and optional D regions are provided by all or part of the human IgH locus; optionally wherein said all or part of the IgH locus includes substantially the full human repertoire of IgH V, D and J regions and intervening sequences. A suitable part of the human IgH locus is disclosed in WO2011004192. In one embodiment, the human IgH part includes (or optionally consists of) bases 105,400,051 to 106,368,585 from human chromosome 14 (coordinates from NCB136). Additionally or alternatively, optionally wherein the vertebrate is a mouse or the cell is a mouse cell, the human V, J and optional D regions are inserted into mouse chromosome 12 at a position corresponding to a position between coordinates 114,667,091 and 114,665,190, optionally at coordinate 114,667,091 (coordinates from NCBIM37, relating to mouse strain C57BL/6J).

In one embodiment of any configuration of the vertebrate or vertebrate cell of the invention when the vertebrate is a mouse, (i) the constant region comprises a mouse or rat Sμ switch and optionally a mouse Cμ region. For example the constant region is provided by the constant region endogenous to the mouse, eg, by inserting human V(D)J region sequences into operable linkage with the endogenous constant region of a mouse genome or mouse cell genome.

In one embodiment of any configuration of the vertebrate or vertebrate cell of the invention when the vertebrate is a rat, (i) the constant region comprises a mouse or rat Sμ switch and optionally a rat Cμ region. For example the constant region is provided by the constant region endogenous to the rat, eg, by inserting human V(D)J region sequences into operable linkage with the endogenous constant region of a rat genome or rat cell genome.

In one embodiment of any configuration of the vertebrate or vertebrate cell of the invention the genome comprises an antibody light chain transgene which comprises all or part of the human Igλ locus including at least one human JA region and at least one human CA region, optionally C_(λ)6 and/or C_(λ)7. Optionally, the transgene comprises a plurality of human JA regions, optionally two or more of J_(λ)1, J_(λ)2, J_(λ6) and J_(λ)7, optionally all of J_(λ)1, J_(λ)2, and J_(λ)7. The human lambda immunoglobulin locus comprises a unique gene architecture composed of serial J-C clusters. In order to take advantage of this feature, the invention in optional aspects employs one or more such human J-C clusters inoperable linkage with the constant region in the transgene, eg, where the constant region is endogenous to the non-human vertebrate or non-human vertebrate cell. Thus, optionally the transgene comprises at least one human J_(λ)-C_(λ) cluster, optionally at least J_(λ)7-C_(λ)7. The construction of such transgenes is facilitated by being able to use all or part of the human lambda locus such that the transgene comprises one or more J-C clusters in germline configuration, advantageously also including intervening sequences between clusters and/or between adjacent J and C regions in the human locus. This preserves any regulatory elements within the intervening sequences which may be involved in VJ and/or JC recombination and which may be recognised by AID (activation-induced deaminase) or AID homologues.

Where endogenous regulatory elements are involved in CSR (class-switch recombination) in the non-human vertebrate, these can be preserved by including in the transgene a constant region that is endogenous to the non-human vertebrate. In the invention, one can match this by using an AID or AID homologue that is endogenous to the vertebrate or a functional mutant thereof. Such design elements are advantageous for maximising the enzymatic spectrum for SHM (somatic hypermutation) and/or CSR and thus for maximising the potential for antibody diversity.

Optionally, the lambda transgene comprises a human EX enhancer. Optionally, the kappa transgene comprises a human Eκ enhancer. Optionally, the heavy chain transgene comprises a heavy chain human enhancer.

In one embodiment of any configuration of the invention the constant region is endogenous to the non-human vertebrate or derived from such a constant region. For example, the vertebrate is a mouse or the cell is a mouse cell and the constant region is endogenous to the mouse. For example, the vertebrate is a rat or the cell is a rat cell and the constant region is endogenous to the rat.

In one embodiment of any configuration of the invention the heavy chain transgene comprises a plurality human IgH V regions, a plurality of human D regions and a plurality of human J regions, optionally substantially the full human repertoire of IgH V, D and J regions.

In one embodiment of any configuration of the invention, the vertebrate or cell comprises a heavy chain further transgene, the further transgene comprising at least one human IgH V region, at least one human D region and at least one human J region, optionally substantially the full human repertoire of IgH V, D and J regions.

In one embodiment of any configuration of the invention,

(i) the heavy chain transgene comprises at least one human IgH V region, at least one human J region, and optionally at least one human D region; and

(ii) the vertebrate or cell comprises a kappa transgene, the kappa transgene comprising at least one human Igκ V region and at least one human J region.

In one embodiment of any configuration of the invention,

(i) the heavy chain transgene comprises at least one human IgH V region, at least one human J region, and optionally at least one human D region; and

(ii) the vertebrate or cell comprises a lambda transgene, the lambda transgene comprising at least one human Igλ V region and at least one human J region.

In one embodiment of any configuration of the invention,

(i) the heavy chain transgene comprises substantially the full human repertoire of IgH V, D and J regions; and

(ii) the vertebrate or cell comprises substantially the full human repertoire of Igκ V and J regions and/or substantially the full human repertoire of Igλ V and J regions.

An aspect provides a B-cell, hybridoma or a stem cell, optionally an embryonic stem cell or haematopoietic stem cell, according to any configuration of the invention. In one embodiment, the cell is a BALB/c, JM8 or AB2.1 or AB2.2 embryonic stem cell (see discussion of suitable cells, and in particular JM8 and AB2.1 cells, in WO2011004192, which disclosure is incorporated herein by reference). In one aspect the ES cell is derived from the mouse BALB/c, C57BL/6N, C57BL/6J, 12955 or 129Sv strain.

An aspect provides a method of isolating an antibody or nucleotide sequence encoding said antibody, the method comprising

(a) immunising (see e.g. Harlow, E. & Lane, D. 1998, 5^(th) edition, Antibodies: A Laboratory Manual, Cold Spring Harbor Lab. Press, Plainview, N.Y.; and Pasqualini and Arap, Proceedings of the National Academy of Sciences (2004) 101:257-259) a vertebrate according to any configuration or aspect of the invention with an antigen such that the vertebrate produces antibodies; and

(b) isolating from the vertebrate an antibody that specifically binds to said antigen and/or a nucleotide sequence encoding at least the heavy and/or the light chain variable regions of said antibody;

optionally wherein the variable regions of said antibody are subsequently joined to a human constant region. Such joining can be effected by techniques readily available in the art, such as using conventional recombinant DNA and RNA technology as will be apparent to the skilled person. See e.g. Sambrook, J and Russell, D. (2001, 3'd edition) Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Lab. Press, Plainview, N.Y.).

Suitably an immunogenic amount of the antigen is delivered. The invention also relates to a method for detecting a target antigen comprising detecting an antibody produced as above with a secondary detection agent which recognises a portion of that antibody.

Isolation of the antibody in step (b) can be carried out using conventional antibody selection techniques, eg, panning for antibodies against antigen that has been immobilised on a solid support, optionally with iterative rounds at increasing stringency, as will be readily apparent to the skilled person.

As a further optional step, after step (b) the amino acid sequence of the heavy and/or the light chain variable regions of the antibody are mutated to improve affinity for binding to said antigen.

Mutation can be generated by conventional techniques as will be readily apparent to the skilled person, eg, by error-prone PCR. Affinity can be determined by conventional techniques as will be readily apparent to the skilled person, eg, by surface plasmon resonance, eg, using Biacore™.

Additionally or alternatively, as a further optional step, after step (b) the amino acid sequence of the heavy and/or the light chain variable regions of the antibody are mutated to improve one or more biophysical characteristics of the antibody, eg, one or more of melting temperature, solution state (monomer or dimer), stability and expression (eg, in CHO or E coli).

An aspect provides an antibody produced by the method of the invention, optionally for use in medicine, eg, for treating and/or preventing a medical condition or disease in a patient, eg, a human.

An aspect provides a nucleotide sequence encoding the antibody of the invention, optionally wherein the nucleotide sequence is part of a vector. Suitable vectors will be readily apparent to the skilled person, eg, a conventional antibody expression vector comprising the nucleotide sequence together in operable linkage with one or more expression control elements.

An aspect provides a pharmaceutical composition comprising the antibody of the invention and a diluent, excipient or carrier, optionally wherein the composition is contained in an IV container (eg, and IV bag) or a container connected to an IV syringe.

An aspect provides the use of the antibody of the invention in the manufacture of a medicament for the treatment and/or prophylaxis of a disease or condition in a patient, eg a human.

In a further aspect the invention relates to a method for producing an antibody specific to a desired antigen the method comprising immunizing a transgenic non-human vertebrate as above with a predetermined antigen and recovering a chimaeric antibody (see e.g. Harlow, E. & Lane, D. 1998, 5^(th) edition, Antibodies: A Laboratory Manual, Cold Spring Harbor Lab. Press, Plainview, N.Y.; and Pasqualini and Arap, Proceedings of the National Academy of Sciences (2004) 101:257-259). Suitably an immunogenic amount of the antigen is delivered. The invention also relates to a method for detecting a target antigen comprising detecting an antibody produced as above with a secondary detection agent which recognises a portion of that antibody.

In a further aspect the invention relates to a method for producing a fully humanised antibody comprising immunizing a transgenic non-human vertebrate as above with a predetermined antigen, recovering a chimaeric antibody or cells expressing the antibody, and then replacing the non-human vertebrate constant region with a human constant region. This can be done by standard cloning techniques at the DNA level to replace the non-human vertebrate constant region with an appropriate human constant region DNA sequence—see e.g. Sambrook, J and Russell, D. (2001, 3'd edition) Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Lab. Press, Plainview, N.Y.).

In a further aspect the invention relates to humanised antibodies and antibody chains produced according to the present invention, both in chimaeric and fully humanised form, and use of said antibodies in medicine. The invention also relates to a pharmaceutical composition comprising such an antibody and a pharmaceutically acceptable carrier or other excipient.

Antibody chains containing human sequences, such as chimaeric human-non-human antibody chains, are considered humanised herein by virtue of the presence of the human protein coding regions region. Fully humanised antibodies may be produced starting from DNA encoding a chimaeric antibody chain of the invention using standard techniques.

Methods for the generation of both monoclonal and polyclonal antibodies are well known in the art, and the present invention relates to both polyclonal and monoclonal antibodies of chimaeric or fully humanised antibodies produced in response to antigen challenge in non-human vertebrates of the present invention.

In a yet further aspect, chimaeric antibodies or antibody chains generated in the present invention may be manipulated, suitably at the DNA level, to generate molecules with antibody-like properties or structure, such as a human variable region from a heavy chain absent a constant region, for example a domain antibody; or a human variable region with any constant region from either heavy or light chain from the same or different species; or a human variable region with a non-naturally occurring constant region; or human variable region together with any other fusion partner. The invention relates to all such chimaeric antibody derivatives derived from chimaeric antibodies identified according to the present invention.

In an example, the genome of the cell or non-human vertebrate of the invention encodes an antibody comprising an antibody chain having a human heavy chain variable region upstream of a mouse light chain constant region in combination with one of:

a fully human antibody light chain;

a non-human vertebrate (e.g., mouse or rat) antibody light chain;

a chimaeric non-human vertebrate (e.g., mouse or rat)—human antibody chain; or an antibody chain having a human heavy chain variable region upstream of a non-human vertebrate (e.g., mouse or rat) light chain constant region;

In a further aspect, the invention relates to use of non-human vertebrates of the present invention in the analysis of the likely effects of drugs and vaccines in the context of a quasi-human antibody repertoire.

The invention also relates to a method for identification or validation of a drug or vaccine, the method comprising delivering the vaccine or drug to a vertebrate of the invention and monitoring one or more of: the immune response, the safety profile; the effect on disease.

The invention also relates to a kit comprising an antibody or antibody derivative as disclosed herein and either instructions for use of such antibody or a suitable laboratory reagent, such as a buffer, antibody detection reagent.

The invention also relates to a method for making an antibody, or part thereof, the method comprising providing:

(i) a nucleic acid encoding an antibody, or a part thereof, obtained according to the present invention; or

(ii) sequence information from which a nucleic acid encoding an antibody obtained according to the present invention, or part thereof, can be expressed to allow an antibody to be produced.

In an embodiment, the invention provides

The non-human vertebrate, mouse, rat, cell or method of any preceding configuration, wherein the genome comprises

(a) said antibody heavy chain transgene; and

(b) an antibody kappa light chain transgene and/or an antibody lambda chain transgene; wherein all of the V, D and J in said transgenes are human V, D and J;

wherein endogenous antibody heavy and light chain expression has been inactivated; and optionally wherein said genome is homozygous for said heavy and light chain transgenes.

In an embodiment, the the kappa and lambda chain transgenes comprise constant regions of said non-human vertebrate species capable of pairing with the constant region of the heavy chain.

In an embodiment, the heavy chain transgene comprises a substantially complete human functional VH, D and JH repertoire.

In an embodiment, the kappa chain transgene comprises a substantially complete human functional Vκ and Jκ repertoire; and the lambda chain transgene comprises a substantially complete human functional VA and JA repertoire.

In a ninth embodiment, the invention provides

A transgenic mouse or rat, or a transgenic mouse or rat cell (eg, an ES cell), whose genome comprises

(a) an antibody heavy chain transgene, the transgene comprising a substantially complete human functional VH, D and JH repertoire operably connected upstream of an endogenous (mouse or rat) mu constant region gene so that the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding an antibody heavy chain having a human variable region and an endogenous mu constant region;

(b) an antibody kappa light chain transgene and/or an antibody lambda chain transgene;

-   -   wherein all of the V, D and J in said transgenes are human V, D         and J;     -   wherein the kappa chain transgene comprises a substantially         complete human functional Vκ and Jκ repertoire; and the lambda         chain transgene comprises a substantially complete human         functional VA and JA repertoire;     -   wherein the kappa and/or lambda chain transgenes comprise         endogenous constant regions capable of pairing with the constant         region of the heavy chain;

(c) a human VpreB gene capable of expressing a human VpreB;

-   -   optionally wherein the human VpreB gene is operably linked to an         endogenous VpreB promoter; and

(d) an endogenous λ5 gene capable of expressing an endogenous λ5;

-   -   wherein endogenous antibody heavy and light chain expression has         been inactivated; and     -   optionally wherein said genome is homozygous for said heavy and         light chain transgenes.

In a tenth embodiment, the invention provides

A transgenic mouse or rat, or a transgenic mouse or rat cell (eg, an ES cell), whose genome comprises

(a) an antibody heavy chain transgene, the transgene comprising a substantially complete human functional VH, D and JH repertoire operably connected upstream of an endogenous (mouse or rat) mu constant region gene so that the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding an antibody heavy chain having a human variable region and an endogenous mu constant region;

-   -   wherein all of the V, D and J in said transgene are human V, D         and J;     -   wherein the heavy chain transgene is devoid of a CH1 gene         segment and the genome of the vertebrate comprises no functional         antibody light chain locus;     -   wherein endogenous antibody heavy chain expression has been         inactivated; and     -   optionally wherein said genome is homozygous for said heavy         chain transgene;

(c) a human VpreB gene capable of expressing a human VpreB;

-   -   optionally wherein the human VpreB gene is operably linked to an         endogenous VpreB promoter; and

(d) an endogenous λ5 gene capable of expressing an endogenous λ5.

Insertion of Human VpreB Gene

As described above, precise insertion of exogenous DNA (such as human VpreB or antibody gene segment DNA) can be effected by homolgous recombination, RMCE or other techniques that will be apparent to the skilled person. Vector manipulation can be effected using recombineering as is well known.

The human VpreB gene is found in human cells within the human antibody VA gene segment cluster (in the nucleotide region between human Igλ V gene segments IV-53 and 5-52 (the nucleotide region being positions 22599200 to 22599926 on human chromosome 22)). Thus, in one embodiment, a nucleotide sequence found between human Igλ V gene segments IV-53 and 5-52, eg, positions 22599200 to 22599926 on human chromosome 22, is inserted into the genome of the cell or vertebrate, wherein the nucleotide sequence comprises a human VpreB gene and optionally the associated human promoter. For example, the nucleotide sequence is inserted in the endogenous lambda antibody locus upstream of the endogenous lambda constant region. In one embodiment, a region of the human VA gene segment cluster comprising at least one (or substantially all) human VA gene segments and the human VpreB and promoter is inserted upstream of the endogenous (eg, mouse or rat) CA. In one example, the insertion is between the most 3′ endogenous JA and the endogenous CA. In another embodiment, the insertion is in the endogenous VA cluster, optionally replacing the VA in whole or in part. In one embodiment, all of the human lambda VA gene segment cluster (including introns) is inserted into the genome so that it replaces the enogenous VA gene segment cluster. In one embodiment, the cell or vertebrate is a mouse or rat, so “endogenous” refers to the regions found in the genome of a mouse or rat.

Expression of the λ5 and VpreB genes is B-cell lineage restricted and is also subject to stage-specific regulation during B-cell development. Binding sites for the transcription factors EBF, E47, Pax-5 and Ikaros are present in the the λ5 and VpreB promoters. The mouse VpreB1 and λ5 genes are both located in the λ5—VpreB1 locus and 4 kb away from each other. A locus control region (LCR) located within the λ5—VpreB1 locus provides the regulatory elements for these genes are. Hence, the region around the two genes are important for the transcription of the VpreB1 and λ5 genes. Furthermore, these two genes are subject to tight stage-specific control during B-cell development. For activation at the pro-B cell stage, early B cell factor (EBF) initiates the activation of the λ5—VpreB1 locus. It has been observed that the EBF with some other factors (E2As) bind the promoters of the two genes. The transcription of these two genes has to be silenced during the transition from the pre-B to the immature B cell stage, in order for light chain rearrangement and binding with heavy chain. Ikaros gene acts as a repressor by competing with EBF for binding to the same sites in the promoters of the VpreB1 and λ5 genes. In summary, the λ5—VpreB1 locus seems to be important for the temporally-controlled activation and inactivation of the surrogate light chain during B-cell development. See:—

-   -   Mol Cell Biol. 1999 January; 19(1):671-9; Analysis of mice with         single and multiple copies of transgenes reveals a novel         arrangement for the lambda5-VpreB1 locus control region;         Sabbattini P, Georgiou A, Sinclair C, Dillon N.     -   Seminars in Immunology, 2005 April; 17(2):121-7; The         lambda5-VpreB1 locus—a model system for studying gene regulation         during early B cell development; Sabbattini P, Dillon N.

Thus, in one embodiment the invention provides a non-human vertebrate, mouse, rat, cell or method according to any configuration, aspect, embodiment or example, wherein the expression of the human VpreB gene is under endogenous control. For example, the vertebrate is a mouse or rat or the cell is a mouse or rat cell (eg, an ES cell) and the human VpreB is under the control of the gene expression control elements of the mouse or rat (cell). This can be achieved, in one example, by replacing an endogenous VpreB (eg, VpreB1) gene with a human VpreB gene at the location where the former usually resides in the genome. Precise gene replacement in genomes can be carried out by homolgous recombination or RMCE as herein described.

Mouse VpreB1 and VpreB2 are found at the following position in a mouse genome:—

Mouse VpreB1: chromosome 16: 16868494 to 16869348

Mouse VpreB2: chromosome 16: 17980658 to Ser. No. 17/981,173

Thus, the invention provides a non-human vertebrate, mouse, rat, cell or method according to any configuration, aspect, embodiment or example, wherein the human VpreB gene is operably linked to an endogenous promoter, optionally an endogenous VpreB promoter (eg, a VpreB1 or VpreB2 promoter). Thus, where the vertebrate (or cell) is a mouse or rat (cell), the human VpreB is operably linked to a VpreB1 or VpreB2 promoter of said mouse or rat (cell). Operable linkage enables the promoter to control expression of the human gene. It is desirable to use endogenous gene expression control (eg, using an endogenous promoter) for human VpreB expression since this will harness the endogenous (eg, mouse or rat) temporal expression control for turning on and off the human VpreB gene (and coordinating this with λ5 expression) during the various stages of B-cell development, particularly up to the point where light chain pairing with heavy chains occurs in immature B-cells. Thereafter, the usual switching off of SLC-component gene expression can be controlled properly by the endogenous control mechanisms when the pre-BCR is no longer participating in B-cell maturation.

Thus, the invention provides a non-human vertebrate, mouse, rat, cell or method according to any configuration, aspect, embodiment or example, wherein the human VpreB gene is operably linked to an endogenous VpreB-λ5 locus control region (LCR). Thus, where the vertebrate (or cell) is a mouse or rat (cell), the human VpreB is operably linked to a VpreB-λ5 LCR of said mouse or rat (cell). LCRs are dominant activating sequences that are able to activate gene expression at any location in the genome. Optionally, the LCR is present at the wild-type position in the genome of the vertebrate or cell. In another example, the LCR is not at the wild-type position in the genome of the vertebrate or cell. In this case, therefore, the human VpreB is positioned away from the endogenous VpreB gene in the genome, which may be advantageous for controlling the human VpreB expression independently of the endogenous λ5.

In an embodiment, the invention provides a non-human vertebrate, mouse, rat, cell or method according to any configuration, aspect, embodiment or example, wherein the human VpreB gene is operably linked to an endogenous VpreB locus control region (LCR). Thus, where the vertebrate (or cell) is a mouse or rat (cell), the human VpreB is operably linked to a VpreB LCR of said mouse or rat (cell). Optionally, the LCR is present at the wild-type position in the genome of the vertebrate or cell. In another example, the LCR is not at the wild-type position in the genome of the vertebrate or cell. In this case, therefore, the human VpreB is positioned away from the endogenous VpreB gene in the genome, which may be advantageous for controlling the human VpreB expression independently of the endogenous λ5.

Optionally one or more DNase I hypersensitive sites (HS) is operably linked in the genome to the human VpreB gene. For example, the HS forms part of the LCR or is 3′ of the LCR or 3′ of the LCR-human VpreB (see, Sabbattini et al 1999, supra).

Optionally in any configuration, aspect, embodiment or example of the inventions, the human VpreB gene is present in multiple copies in the genome, eg, the genome comprises 2, 3, 4, 5, 6, 7, 8, 9 or 10 copies of the human VpreB gene. For example, 2, 3, 4 or 5 copies are operably linked to the control element (eg, promoter and/or LCR) mentioned above.

Assessing the Development of the B-Cell Repertoire

Some assays can be used to investigate if the invention leads to promotion of B-cell development and repertoire. Although described in the context of transgenic mice below, these comments apply equally to other transgenic non-human vertebrates of the invention.

1. In chimaeric mouse, after introducing the human DNA fragment covering several human Vs, all Ds and Js between mouse Js and mouse Cs, the usage of the inserted human Vs can be tested by sequencing the mRNA products of the IgH locus in chimaeric mouse. In this case, human Vs will compete with endogenous mouse Vs for formation of the heavy chain. The usage of human Vs vs mouse Vs will be compared in mice with and without introduction of human VpreB gene into the chimaeric mouse with/without the endogenous mouse VpreB gene. More detail is provided in the non-limiting example below.

2. The use of human VpreB or mouse VpreB protein for pre-BCR assembly can be analysed in pre-B cell populations from mice with or without the human VpreB. Surrogate light chain can only be detected in the transitional pre-BI and large pre-BII stages in bone marrow. To enrich the cell populations in bone marrow for investigating pre-BCR, the deregulation of surrogate light chain can be blocked by inactivation of SLP65 gene (Nature Immunol. 2003 4, 38-43; The adaptor protein SLP-65 acts as a tumor suppressor that limits pre-B cell expansion. Flemming, A., Brummer, T., Reth, M. & Jumaa, H.) or both IRF4 (interferon-regulatory factor 4) and IRF8 genes (Genes Dev. 2003 17, 1703-1708; IRF-4,8 orchestrate the pre-B-to-B transition in lymphocyte development. Lu, R., Medina, K. L., Lancki, D. W. & Singh, H). It has been reported that pre-BII cells from mice that are deficient in these factors continue to express surrogate-light-chain genes and therefore retain pre-BCR expression. The unlimited supply of surrogate light chains lead to a hyperplastic pre-Ill-cell phenotype in vivo and provide large number of pre-B cells for us to analyze.

3. In instances (eg, as described in part 1 above), where mouse Vs are retained following insertion of human gene segments, the mouse endogenous heavy chain VDJ is inactivated, and this may be done while leaving mouse Vs intact. Thus, for the formation of chimaeric heavy chain genes, the heavy chain V regions should only be transcribed from human heavy VDJs. To assess the relative use of human v mouse Vs for heavy chain gene formation, the B-cell repertoire and diversity will be compared between mice having human VpreB gene v mice lacking the human VpreB (eg, mice with mouse VpreB only). Pairing of human Vs with human VpreB will be preferable to pairing of mouse Vs with human VpreB, and thus antibody heavy chains comprising human variable regions will be selected during B-cell development.

As cells transit from the pre-BI to pre-BII stage, the pre-BCR induces a signal for proliferative expansion. Pre-BCR controls the number of developing pre-B cells by controlling proliferation, which represents clonal expansion. Pre-BII cells enter two to seven rounds of cell division when Igμ chain assembles into pre-BCRs. This ultimately results in a more diverse antibody repertoire, as later in development, each cell will recombine and express a unique L chain. In this process, the association of surrogate light chain with μ chain monitors the cells to determine whether it has successfully completed VDJH recombination and expresses a functional μH chain which will form the pre-PCR for downstream signalling. In this case, the strength of the pairing between the μH and SL chains may determine the amplitude of the transmitted signal, and thereby, the extent of proliferation (J. Immunol. 2006, 177 pp. 2242-2249; Pre-B cell receptor assesses the quality of IgH chains and tunes the Pre-B cell repertoire by delivering differential signals. Y. Kawano, S. Yoshikawa, Y. Minegishi and H. Karasuyama). Therefore, in the mouse with chimaeric heavy chain (human VDJs and mouse Cs)—by introducing the human VpreB gene to provide complete species-matching with the chimaeric heavy chain, It is expected that the repertoire and so the diversity of chimaeric antibodies will be improved by using human VpreB instead of mouse VpreB for association with the heavy chains. The improvement for B cell repertoire and diversity can be showed by increased usage of human heavy VDJs and/or a shift in VH gene family usage in mouse expressing human VpreB (see the examples below for possible techniques).

The invention provides the following aspects:—

-   1. A non-human vertebrate (eg, a mouse or rat) or cell (eg, a mouse     cell or rat cell) whose genome comprises an antibody heavy chain     transgene,     -   the transgene comprising     -   (a) one or more human VH gene segments, one or more human D gene         segments and one or more human JH gene segments operably         connected upstream of a constant region gene so that the         transgene is capable of undergoing VDJ recombination in vivo to         produce an antibody gene comprising a rearranged VDJC encoding         an antibody heavy chain having a human variable region and a         constant region; or     -   (b) a rearranged VDJ encoding a human variable region operably         connected upstream of a constant region gene so that the         transgene encodes (optionally following VDJ combination with the         constant region) an antibody heavy chain having a human variable         region and a constant region;     -   the genome further comprising     -   (i) a human VpreB gene capable of expressing a human VpreB, and     -   (ii) a non-human vertebrate (eg, mouse or rat) λ5 gene;     -   wherein the vertebrate or cell is capable of expressing a         chimaeric surrogate light chain comprising human VpreB and         non-human vertebrate λ5 for pairing with the heavy chain. -   2. The vertebrate or cell of aspect 1, wherein the constant region     is a non-human vertebrate (eg, mouse or rat) constant region so that     the chimaeric surrogate light chain is species- or strain-matched     with the heavy chain. -   3. A mouse or mouse cell according to aspect 1 or 2, wherein the     constant region and the λ5 gene are mouse constant region and mouse     λ5 gene, optionally of the same mouse strain. -   4. The vertebrate or cell of aspect 2 or 3, wherein the strain is a     129-derived, mouse black 6-derived or JM8-derived strain (eg, a     C57BL/6—129/Sv hybrid strain). -   5. The vertebrate or cell of aspect 2 or 3, wherein the constant     region is a mouse 129 or mouse

Black 6 constant region and the λ5 gene is a mouse 129 or mouse Black 6 λ5 gene.

-   6. The vertebrate or cell of any preceding aspect, wherein said     constant region gene and λ5 gene are endogenous genes of said     vertebrate or cell. -   7. The vertebrate or cell of any preceding aspect, wherein said     genome is homozygous for said transgene, human VpreB gene and     non-human vertebrate λ5 gene. -   8. The vertebrate or cell of any preceding aspect, wherein     endogenous non-human vertebrate antibody heavy chain expression has     been inactivated. -   9. The vertebrate or cell of any preceding aspect, wherein     endogenous non-human vertebrate antibody light chain expression has     been inactivated. -   10. The vertebrate or cell of any preceding aspect, wherein the     heavy chain transgene is devoid of a

CH1 gene segment and the genome comprises no functional antibody light chain locus.

-   11. The vertebrate or cell of any preceding aspect, wherein the     heavy chain transgene is devoid of a gamma CH1 gene segment and the     genome comprises no functional antibody light chain locus. -   12. The vertebrate or cell of any preceding aspect, wherein the     heavy chain transgene is devoid of a mu CH1 gene segment and the     genome comprises no functional antibody light chain locus. -   13. The vertebrate or cell of any preceding aspect, wherein said     constant region is a mu constant region, optionally endogenous mu     constant region. -   14. The vertebrate or cell of any preceding aspect, wherein the     human VpreB gene has a nucleotide sequence that is at least 85%     identical to SEQ ID NO: 1. -   15. The vertebrate or cell of any preceding aspect, wherein said     genome does not comprise a non-human vertebrate (eg, mouse or rat)     species VpreB1 and/or VpreB2 gene. -   16. The cell of any preceding aspect, wherein the cell is an ES     cell, an iPS cell, a hybridoma, an immortalised cell or a B-cell     (eg, an immortalised B-cell). -   17. The cell of aspect 16, wherein the cell is derived from C57BL/6,     M129 (eg, 129/SV), BALB/c or a hybrid of C57BL/6, M129 or BALB/c     (eg, a C57BL/6—129/Sv hybrid). -   18. The vertebrate or cell of any preceding aspect, wherein the     genome comprises an insertion of a human lambda V (or VJ) region     comprising all of the V (and optionally also J) regions and     intervening sequences, wherein the lambda V (or VJ) region comprises     a human VpreB gene. -   19. The vertebrate or cell of aspect 18, wherein non-functional V     and/or J gene segments are omitted. -   20. The vertebrate or cell of aspect 18 or 19, wherein the lambda V     (or VJ) region comprises a human VpreB gene and its associated human     promoter. -   21. The vertebrate or cell of any preceding aspect, wherein the     genome comprises an insertion of DNA corresponding to positions     22599200 to 22599926 on human chromosome 22. -   22. The vertebrate or cell of any one of aspects 18 to 21, wherein     the insertion is an insertion into an antibody light chain locus;     optionally an insertion into endogenous lambda locus upstream of the     endogenous lambda constant region or an insertion into endogenous     kappa locus upstream of the endogenous kappa constant region. -   23. The vertebrate or cell of any one of aspects 18 to 22, wherein     the insertion replaces the endogenous VA in whole or in part. -   24. The vertebrate or cell of any preceding aspect, wherein the     human VpreB gene is not within the endogenous (non-human vertebrate     or cell) VpreB-λ5 locus. -   25. The vertebrate or cell of any preceding aspect, wherein the     expression of the human VpreB gene is under endogenous (non-human     vertebrate or cell) control. -   26. The vertebrate or cell of any preceding aspect, wherein the     human VpreB gene is operably linked to one or more DNase I     hypersensitive sites. -   27. The vertebrate or cell of any preceding aspect, wherein the     human VpreB gene is present in the genome in 2, 3, 4, 5, 6, 7, 8, 9     or 10 copies. -   28. The vertebrate or cell of any preceding aspect, which is a     rodent, mouse or rat; or a rodent, mouse or rat cell. -   29. The vertebrate or cell of any preceding aspect, wherein all λ5     sequences in the genome are non-human vertebrate λ5 sequences. -   30. The vertebrate or cell of any preceding aspect, wherein the     genome is devoid of a human λ5 nucleotide sequence. -   31. A non-human vertebrate (eg, a mouse or rat) or cell (eg, a mouse     cell or rat cell) whose genome comprises an antibody heavy chain     transgene for producing heavy chains that are devoid of a CH1,     -   the transgene comprising     -   (a) one or more human VH gene segments, one or more human D gene         segments and one or more human JH gene segments operably         connected upstream of a constant region gene so that the         transgene is capable of undergoing VDJ recombination in vivo to         produce an antibody gene comprising a rearranged VDJC encoding         an antibody heavy chain having a human variable region and a         constant region; or     -   (b) a rearranged VDJ encoding a human variable region operably         connected upstream of a constant region gene so that the         transgene encodes (optionally following VDJ combination with the         constant region) an antibody heavy chain having a human variable         region and a constant region;     -   the genome further comprising a human VpreB gene capable of         expressing a human VpreB;     -   wherein the constant region is devoid of a functional CH1 gene;         and     -   wherein the vertebrate or cell is capable of expressing a human         VpreB for pairing with the heavy chains devoid of CH1. -   32. The vertebrate or cell of aspect 31, wherein the human VpreB     gene does not comprise a λ5 sequence or constant region sequence. -   33. The vertebrate or cell of aspect 31 or 32, wherein the constant     region is a human constant region. -   34. The vertebrate or cell of aspect 31 or 32, wherein the constant     region is a non-human vertebrate (eg, mouse or rat) constant region;     optionally of a strain as recited in aspect 4. -   35. The vertebrate or cell of any one of aspects 31 to 34, wherein     said genome is homozygous for said transgene and human VpreB gene. -   36. The vertebrate or cell of any one of aspects 31 to 35, wherein     endogenous non-human vertebrate antibody heavy chain expression has     been inactivated. -   37. The vertebrate or cell of any one of aspects 31 to 36, wherein     endogenous non-human vertebrate antibody light chain expression has     been inactivated. -   38. The vertebrate or cell of any one of aspects 31 to 37, wherein     the genome comprises no functional antibody light chain locus. -   39. The vertebrate or cell of any one of aspects 31 to 38, wherein     the heavy chain transgene is devoid of a gamma CH1 gene segment and     the genome comprises no functional antibody light chain locus. -   40. The vertebrate or cell of any one of aspects 31 to 39, wherein     the heavy chain transgene is devoid of a mu CH1 gene segment and the     genome comprises no functional antibody light chain locus. -   41. The vertebrate or cell of any aspects 31 to 40, wherein said     constant region is a mu constant region (optionally endogenous mu     constant region) or a gamma constant region (optionally endogenous     gamma constant region). -   42. The vertebrate or cell of any one of aspects 31 to 41, wherein     the human VpreB gene has a nucleotide sequence that is at least 85%     identical to SEQ ID NO: 1. -   43. The vertebrate or cell of any one of aspects 31 to 42, wherein     said genome does not comprise a non-human species VpreB gene. -   44. The cell of any one of aspects 31 to 43, wherein the cell is an     ES cell, an iPS cell, a hybridoma, an immortalised cell or a B-cell     (eg, an immortalised B-cell). -   45. The cell of aspect 44, wherein the cell is derived from C57BL/6,     M129 (eg, 129/SV), BALB/c or a hybrid of C57BL/6, M129 or BALB/c     (eg, a C57BL/6—129/Sv hybrid). -   46. The vertebrate or cell of any one of aspects 31 to 45, wherein     the genome comprises an insertion of a human lambda V (or VJ) region     comprising all of the V (and optionally also J) regions and     intervening sequences, wherein the lambda V (or VJ) region comprises     a human VpreB gene. -   47. The vertebrate or cell of aspect 46, wherein non-functional V     and/or J gene segments are omitted. -   48. The vertebrate or cell of aspect 46 or 47, wherein the lambda V     (or VJ) region comprises a human VpreB gene and its associated human     promoter. -   49. The vertebrate or cell of any one of aspects 31 to 48, wherein     the genome comprises an insertion of DNA corresponding to positions     22599200 to 22599926 on human chromosome 22. -   50. The vertebrate or cell of any one of aspects 46 to 49, wherein     the insertion is an insertion into an antibody light chain locus;     optionally an endogenous lambda locus upstream of the endogenous     lambda constant region. -   51. The vertebrate or cell of any one of aspects 46 to 50, wherein     the insertion replaces the endogenous VA in whole or in part. -   52. The vertebrate or cell of any one of aspects 31 to 51, wherein     the human VpreB gene is not within the endogenous (non-human     vertebrate or cell) VpreB-λ5 locus. -   53. The vertebrate or cell of any one of aspects 31 to 52, wherein     the expression of the human VpreB gene is under endogenous     (non-human vertebrate or cell) control. -   54. The vertebrate or cell of any one of aspects 31 to 53, wherein     the human VpreB gene is operably linked to one or more DNase I     hypersensitive sites. -   55. The vertebrate or cell of any one of aspects 31 to 54, wherein     the human VpreB gene is present in the genome in 2, 3, 4, 5, 6, 7,     8, 9 or 10 copies. -   56. The vertebrate or cell of any one of aspects 31 to 55, which is     a rodent, mouse or rat; or a rodent, mouse or rat cell. -   57. The vertebrate or cell of any one of aspects 31 to 56, wherein     the genome is devoid of a human λ5 gene. -   58. The vertebrate or cell of any one of aspects 31 to 57, wherein     the genome is devoid of a λ5 gene. -   59. A method of constructing a transgenic non-human vertebrate cell     (eg, an ES cell, eg, a mouse or rat ES cell), the method comprising     -   (i) introducing into the genome of a non-human vertebrate cell         (or an ancestor thereof) one or more human VH gene segments, one         or more human D gene segments and one or more human JH gene         segments so that said gene segments are operably connected         upstream of a constant region gene (optionally an endogenous         non-human vertebrate constant region gene) to form a heavy chain         transgene, wherein in said cell or a progeny thereof the         transgene is capable of undergoing VDJ recombination in vivo to         produce an antibody gene comprising a rearranged VDJC encoding         an antibody heavy chain having a human variable region and a         constant region; or     -   (ii) introducing into the genome of a non-human vertebrate cell         (or an ancestor thereof) a rearranged VDJ encoding a human         variable region so that said VDJ is operably connected upstream         of a constant region gene (optionally an endogenous non-human         vertebrate constant region gene) to form a heavy chain         transgene, wherein in said cell or a progeny thereof the         transgene encodes (optionally following VDJ combination with the         constant region) an antibody heavy chain having a human variable         region and a constant region;     -   and wherein the method further comprises introducing into the         cell a human VpreB gene in the absence of a λ5 sequence, wherein         the cell (or a progeny cell or vertebrate) is capable of         expressing human VpreB for pairing with the human variable         region of the heavy chain. -   60. The method of aspect 59, wherein the genome of the product cell     of the method is as recited in any one of aspects 1 to 58. -   61. The method of aspect 59 or 60, further comprising making a     progeny (progeny cell or vertebrate) of the product cell made     according to aspect 59 or 60, wherein the progeny is homozygous for     said heavy chain transgene and endogenous non-human vertebrate     antibody expression has been inactivated. -   62. The method of aspect 61, wherein the progeny cell is an ES cell,     an iPS cell, a hybridoma, an immortalised cell or a B-cell (eg, an     immortalised B-cell); optionally a cell derived from a strain or     hybrid as recited in aspect 17. -   63. The method of aspect 61, wherein the progeny vertebrate is a     rodent, mouse or rat. -   64. A method of promoting B-cell development in a non-human     vertebrate (eg, a mouse or rat), the method comprising     -   (a) providing in a non-human vertebrate embryonic stem (ES) cell         genome an immunoglobulin transgene capable of expressing an         antibody mu heavy chain, wherein the antibody heavy chain         comprises a human variable region and a mu constant region         (optionally an endogenous non-human vertebrate mu constant         region); and creating a first non-human vertebrate from said ES         cell or a progeny thereof;     -   (b) providing in a second ES cell genome a second transgene         comprising a human VpreB gene capable of expressing a human         VpreB; and creating a second non-human vertebrate from said ES         cell or a progeny thereof; and     -   (c) creating by breeding a third non-human vertebrate capable of         co-expressing the mu antibody heavy chain and human VpreB         wherein a pre-B-cell receptor can form to promote B-cell         development of cells bearing a mu heavy chain in said third         vertebrate; the third vertebrate being made by crossing said         first and second vertebrates or progeny thereof by breeding to         create said third vertebrate, the third vertebrate comprising         the first and second transgenes, and wherein endogenous heavy         chain expression has been inactivated in said third vertebrate. -   65. The method of aspect 64, wherein in (a) the heavy chain     transgene is constructed to be devoid of a CH1 gene segment and the     genome of the third non-human vertebrate comprises no functional     antibody light chain locus. -   66. The method of aspect 64 or 65, wherein the genome of the third     non-human vertebrate is as recited in any one of aspects 1 to 58. -   67. The method of any one of aspects 64 to 65, wherein the genome of     the third vertebrate does not comprise a non-human vertebrate (eg,     mouse or rat) species VpreB1 and/or VpreB2 gene. -   68. The method of any one of aspects 64 to 67, wherein the third     non-human vertebrate expresses a λ5 of a non-human vertebrate (eg,     mouse or rat) species and said constant region is a constant region     of the same non-human vertebrate (eg, mouse or rat) species as the     λ5 gene; optionally wherein the λ5 and constant region are an     endogenous λ5 and constant region of mouse or rat. -   69. A transgenic mouse or rat according to the third vertebrate of     any one of aspects 64 to 68, or a progeny thereof. -   70. A method of promoting B-cell development in a non-human     vertebrate (eg, a mouse or rat), the method comprising     -   (i) inactivating endogenous heavy chain expression in said         vertebrate;     -   (ii) providing in the genome of said vertebrate an         immunoglobulin transgene capable of expressing an antibody mu         heavy chain, wherein the antibody heavy chain comprises a human         variable region and a non-human vertebrate mu constant region         (optionally an endogenous non-human vertebrate mu constant         region); and     -   (ii) providing in the genome of said vertebrate a second         transgene capable of expressing a human VpreB wherein a         pre-B-cell receptor can form to promote B-cell development of         cells bearing a mu heavy chain in said vertebrate;

Optionally wherein the genome is homozygous for said first transgene.

-   71. The method of aspect 70, wherein in (a) the heavy chain     transgene is constructed to be devoid of a CH1 gene segment and the     genome of the third non-human vertebrate comprises no functional     antibody light chain locus. -   72. The method of aspect 70 or 71, wherein the genome of the     vertebrate is as recited in any one of aspects 1 to 58. -   73. The method of any one of aspects 70 to 72, wherein the genome of     the vertebrate does not comprise a non-human vertebrate (eg, mouse     or rat) species VpreB1 and/or VpreB2 gene. -   74. The method of any one of aspects 70 to 73, wherein the third     vertebrate expresses a λ5 of a non-human vertebrate (eg, mouse or     rat) species and said constant region is a constant region of the     same non-human vertebrate (eg, mouse or rat) species as the λ5 gene;     optionally wherein the λ5 and constant region are an endogenous λ5     and constant region of mouse or rat. -   75. A transgenic mouse or rat obtained or obtainable by the method     of any one of aspects 70 to 74, or a progeny thereof. -   76. A non-human vertebrate, mouse, rat, cell or method according to     any preceding aspect, wherein the expression of the human VpreB gene     is under edogenous control. -   77. The non-human vertebrate, mouse, rat, cell or method according     to aspect 76, wherein the human VpreB gene is operably linked to an     endogenous promoter, optionally an endogenous VpreB promoter (eg, a     VpreB1 promoter). -   78. The non-human vertebrate, mouse, rat, cell or method of any     preceding aspect, wherein the genome comprises     -   (a) said antibody heavy chain transgene; and     -   (b) an antibody kappa light chain transgene and/or an antibody         lambda chain transgene;     -   wherein all of the V, D and J in said transgenes are human V, D         and J;     -   wherein endogenous antibody heavy and light chain expression has         been inactivated; and     -   optionally wherein said genome is homozygous for said heavy and         light chain transgenes. -   79. The non-human vertebrate, mouse, rat, cell or method of aspect     78, wherein the kappa and lambda chain transgenes comprise constant     regions of said non-human vertebrate species capable of pairing with     the constant region of the heavy chain. -   80. The non-human vertebrate, mouse, rat, cell or method of aspect     78 or 79, wherein the heavy chain transgene comprises a     substantially complete human functional VH, D and JH repertoire. -   81. The non-human vertebrate, mouse, rat, cell or method of aspect     78, 79 or 80, wherein the kappa chain transgene comprises a     substantially complete human functional Vκ and Jκ repertoire; and     the lambda chain transgene comprises a substantially complete human     functional VA and JA repertoire. -   82. A transgenic mouse or rat, or a transgenic mouse or rat cell     (eg, an ES cell), whose genome comprises     -   (a) an antibody heavy chain transgene, the transgene comprising         a substantially complete human functional VH, D and JH         repertoire operably connected upstream of an endogenous (mouse         or rat) mu constant region gene so that the transgene is capable         of undergoing VDJ recombination in vivo to produce an antibody         gene comprising a rearranged VDJC encoding an antibody heavy         chain having a human variable region and an endogenous mu         constant region;     -   (b) an antibody kappa light chain transgene and/or an antibody         lambda chain transgene;         -   wherein all of the V, D and J in said transgenes are human             V, D and J;         -   wherein the kappa chain transgene comprises a substantially             complete human functional Vκ and Jκ repertoire; and the             lambda chain transgene comprises a substantially complete             human functional VA and JA repertoire;         -   wherein the kappa and/or lambda chain transgenes comprise             endogenous constant regions capable of pairing with the             constant region of the heavy chain;     -   (c) a human VpreB gene capable of expressing a human VpreB;         -   optionally wherein the human VpreB gene is operably linked             to an endogenous VpreB promoter or its associated human             promoter; and     -   (d) an endogenous λ5 gene capable of expressing an endogenous         λ5;         -   wherein endogenous antibody heavy and light chain expression             has been inactivated; and         -   optionally wherein said genome is homozygous for said heavy             and light chain transgenes. -   83. A transgenic mouse or rat, or a transgenic mouse or rat cell     (eg, an ES cell), whose genome comprises     -   (a) an antibody heavy chain transgene, the transgene comprising         a substantially complete human functional VH, D and JH         repertoire operably connected upstream of an endogenous (mouse         or rat) mu constant region gene so that the transgene is capable         of undergoing VDJ recombination in vivo to produce an antibody         gene comprising a rearranged VDJC encoding an antibody heavy         chain having a human variable region and an endogenous mu         constant region;         -   wherein all of the V, D and J in said transgene are human V,             D and J;         -   wherein the heavy chain transgene is devoid of a CH1 gene             segment (eg, a gamma CH1) and the genome of the vertebrate             comprises no functional antibody light chain locus;         -   wherein endogenous antibody heavy chain expression has been             inactivated; and         -   optionally wherein said genome is homozygous for said heavy             chain transgene;     -   (c) a human VpreB gene capable of expressing a human VpreB;         -   optionally wherein the human VpreB gene is operably linked             to an endogenous VpreB promoter or its associated human             promoter; and     -   (d) optionally an endogenous λ5 gene capable of expressing an         endogenous λ5.     -   As demonstrated in Example below, the present invention is         useful for influencing the repertoire of human heavy chain         variable domains and variable region DNA and RNA sequences that         can be produced in a non-human vertebrate. Thus, the invention         provides:— -   84. A mouse according to any one of aspects 1 to 58 or 75 to 83,     which expresses a repertoire of Ig heavy chain variable regions that     significantly differs (eg, as indicated by a probability of less     than 0.05 in a chi-squared test) from the heavy chain variable     region repertoire of a control vertebrate in the proportion (eg,     percentage) of use of heavy chain variable gene segments, wherein     the control and said non-human vertebrate genomes are of the same     background mouse strain and both comprise said antibody heavy chain     transgene, the transgenes being identical in the control and said     mouse, and wherein the control does not express a human VpreB.     -   In an example of any aspect, the mouse and control strains are         129 or 129-derived (eg, 129 crossed with C57BL/6).     -   In an example of any aspect, each repertoire is a repertoire of         heavy chain variable region RNA. Optionally, the variable         regions are provided as part of IgH heavy chains or         corresponding RNA.     -   In an example of any aspect, each transgene comprises at least         11 human gene segments, all or substantially all human D gene         segments and all or substantially all human JH gene segments.         For example, each transgene comprises human D1-1, 2-2, 3-9,         3-10, 4-11, 5-12, 6-13, 1-14, 2-15, 3-16, 4-17, 5-18, 6-19,         1-20, 2-21, 3-22, 4-23, 5-24, 6-25, 1-26 and 7-27. For example,         each transgene comprises human J1, J2, J3, J4, J5 and J6. For         example, each transgene comprises (optionally in 5′ to 3′ order)         VH3-13, 3-11, 3-9, 1-8, 3-7, 2-5, 7-4-1, 4-4, 1-3, 1-2 and 6-1. -   85. A mouse according to any one of aspects 1 to 58 or 75 to 84,     which expresses a repertoire of Ig heavy chain variable regions that     significantly differs (eg, as indicated by a probability of less     than 0.05 in a binomial distribution test) from the IgH heavy chain     variable region repertoire of a control vertebrate in the proportion     (eg, percentage) of use of heavy chain JH gene segments, wherein the     control and said non-human vertebrate genomes are of the same     background mouse strain and both comprise said antibody heavy chain     transgene, the transgenes being identical in the control and said     mouse, and wherein the control does not express a human VpreB. -   86. A mouse according to any one of aspects 1 to 58 or 75 to 85,     which expresses a repertoire of Ig heavy chain variable regions that     significantly differs (eg, as indicated by a probability of less     than 0.05 in a binomial distribution test) from the IgH heavy chain     variable region repertoire of a control vertebrate in the proportion     (eg, percentage) of use of one, more or all heavy chain variable     gene segments selected from VH6-1, VH1-3, VH3-7, VH1-8, VH3-9,     VH3-11, JH1 and JH6, wherein the control and said non-human     vertebrate genomes are of the same background mouse strain and both     comprise said antibody heavy chain transgene, the transgenes being     identical in the control and said mouse, and wherein the control     does not express a human VpreB. -   87. The mouse of any one of aspects 84 to 86, wherein use of one,     more or all of VH6-1, VH1-3, VH3-9 and JH1 is higher in the     repertoire of said mouse than in the repertoire of the control. -   88. The mouse of any one of aspects 84 to 87, wherein use of one,     more or all of VH3-7, VH1-8, VH3-11 and JH6 is lower in the     repertoire of said mouse than in the repertoire of the control.     -   In an example of any of aspects 84 onwards, the IgH heavy chain         variable region repertoire is naïve, ie, has not been         preselected against a predetermined target antigen.     -   In an example of any of aspects 84 onwards, the scope is not         limited to a mouse, but the scope of the aspect relates to a         non-human vertebrate (eg, a mouse or a rat). Aspects 84 onwards         can in the alternative, therefore, be read with this scope and         for possible inclusion in claims herein.

It will be understood that particular embodiments described herein are shown by way of illustration and not as limitations of the invention. The principal features of this invention can be employed in various embodiments without departing from the scope of the invention. Those skilled in the art will recognize, or be able to ascertain using no more than routine study, numerous equivalents to the specific procedures described herein. Such equivalents are considered to be within the scope of this invention and are covered by the claims. All publications and patent applications mentioned in the specification are indicative of the level of skill of those skilled in the art to which this invention pertains. All publications and patent applications are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference. The use of the word “a” or “an” when used in conjunction with the term “comprising” in the claims and/or the specification may mean “one,” but it is also consistent with the meaning of “one or more,” “at least one,” and “one or more than one.” The use of the term “or” in the claims is used to mean “and/or” unless explicitly indicated to refer to alternatives only or the alternatives are mutually exclusive, although the disclosure supports a definition that refers to only alternatives and “and/or.” Throughout this application, the term “about” is used to indicate that a value includes the inherent variation of error for the device, the method being employed to determine the value, or the variation that exists among the study subjects.

As used in this specification and claim(s), the words “comprising” (and any form of comprising, such as “comprise” and “comprises”), “having” (and any form of having, such as “have” and “has”), “including” (and any form of including, such as “includes” and “include”) or “containing” (and any form of containing, such as “contains” and “contain”) are inclusive or open-ended and do not exclude additional, unrecited elements or method steps

The term “or combinations thereof” as used herein refers to all permutations and combinations of the listed items preceding the term. For example, “A, B, C, or combinations thereof is intended to include at least one of: A, B, C, AB, AC, BC, or ABC, and if order is important in a particular context, also BA, CA, CB, CBA, BCA, ACB, BAC, or CAB. Continuing with this example, expressly included are combinations that contain repeats of one or more item or term, such as BB, AAA, MB, BBC, AAABCCCC, CBBAAA, CABABB, and so forth. The skilled artisan will understand that typically there is no limit on the number of items or terms in any combination, unless otherwise apparent from the context.

Any part of this disclosure may be read in combination with any other part of the disclosure, unless otherwise apparent from the context.

All of the compositions and/or methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods of this invention have been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the compositions and/or methods and in the steps or in the sequence of steps of the method described herein without departing from the concept, spirit and scope of the invention. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims.

The present invention is described in more detail in the following non limiting prophetic exemplification (Examples 1-5). An additional example, Example 6, not prophetic, but was actually performed and yielded data whose results demonstrate the present invention.

EXAMPLES

Example 1: Introduction of Human DNA Gene Segments into ES Cells

By way of illustration, the following example is provided on the introduction of human antibody gene segment DNA into Cell culture of C57BL/6N-derived cell lines, such as the JM8 male ES cells. This will follow standard techniques. The JM8 ES cells have been shown to be competent in extensively contributing to somatic tissues and to the germline, and are being used for large mouse mutagenesis programs at the Sanger Institute such as EUCOMM and KOMP (Pettitt, S. J., Liang, Q., Rairdan, X. Y., Moran, J. L., Prosser, H. M., Beier, D. R., Lloyd, K. C., Bradley, A., and Skarnes, W. C. (2009). Agouti C57BL/6N embryonic stem cells for mouse genetic resources. Nature Methods.). JM8 ES cells (1.0×10⁷) will be electroporated (500 μF, 230V; BioRad) with 10 μg I-Scel linearized human BAC DNA. The transfectants will be selected with either Puromycin (3 μg/ml) or G418 (150 μg/ml). The selection will begin either 24 hours (with G418) or 48 hours (with Puromycin) post electroporation and proceed for 5 days. 10 μg linearized human BAC DNA can yield up to 500 Puromycin or G418 resistant ES cell colonies. The antibiotic resistant ES cell colonies will be picked into 96-well cell culture plates for genotyping to identify the targeted clones.

Once targeted mouse ES cell clones are identified, they will be analyzed by array Comparative Genomic Hybridization (CGH) for total genome integrity (Chung, Y. J., Jonkers, J., Kitson, H., Fiegler, H., Humphray, S., Scott, C., Hunt, S., Yu, Y., Nishijima, I., Velds, A., et al. (2004). A whole-genome mouse BAC microarray with 1-Mb resolution for analysis of DNA copy number changes by array comparative genomic hybridization. Genome research 14, 188-196. and Liang, Q., Conte, N., Skarnes, W. C., and Bradley, A. (2008). Extensive genomic copy number variation in embryonic stem cells. Proceedings of the National Academy of Sciences of the United States of America 105, 17453-17456.). ES cells that have abnormal genomes do not contribute to the germline of the chimaeric mice efficiently. BAC integrity will be examined by PCR-amplifying each known functional V gene in the BAC. For example, in one approach the first human BAC chosen for the IgH locus has 6 functional VH gene segments and all human D and JH gene segments. To confirm the integrity of this BAC for the presence of these IgH genes, pairs of PCR primers will be designed and used to PCR-amplify genomic DNA from the targeted ES cells. The human wild-type size and sequence of these fragments will ensure that the inserted BAC has not been rearranged. More detailed CGH will also confirm the integrity of the inserted BACs. For example, one skilled in the art could use an oligo aCGH platform, which is developed by Agilent Technologies, Inc. This platform not only enables one to study genome-wide DNA copy number variation at high resolution (Barrett, M. T., Scheffer, A., Ben-Dor, A., Sampas, N., Lipson, D., Kincaid, R., Tsang, P., Curry, B., Baird, K., Meltzer, P. S., et al. (2004). Comparative genomic hybridization using oligonucleotide microarrays and total genomic DNA. Proceedings of the National Academy of Sciences of the United States of America 101, 17765-17770.), but permit examination of a specific genome region using custom designed arrays. Comparing the traditional aCGH techniques which rely on cDNA probes or whole BAC probes, the 60-mer oligonucleotides probes can ensure specific hybridization and high sensitivity and precision that is needed in order to detect the engineered chromosome alterations that we have made. For example, oligos designed to hybridize at regular intervals along the entire length of the inserted BAC would detect even quite short deletions, insertions or other rearrangements. Also, this platform provides the greatest flexibility for customized microarray designs. The targeted ES cell genomic DNA and normal human individual genomic DNA will be labelled separately with dyes and hybridized to the array. Arrays slides will be scanned using an Aglient Technologies DNA microarray scanner. Reciprocal fluorescence intensities of dye Cy5 and dye Cy3 on each array image and the log 2 ratio values will be extracted by using Bluefuse software (Bluegnome). Spots with inconsistent fluorescence patterns (“confidence”<0.29 or “quality”=0) will be excluded before normalizing all log 2 ratio values. Within an experiment, Log 2 ratio between −0.29 and +0.29 for the signal from any oligo probe are regarded as no copy number change. The log 2 ratio threshold for “Duplication” is usually >0.29999, and for deletion is <0.29999.

Example 2: Introducing the Human VpreB Gene into the Mouse Lambda Locus by Inserting the Human Lambda V Gene Segment Cluster

The human VpreB gene is inserted into a mouse genome by introducing human lambda gene segment DNA, wherein the nucleotide sequence of the DNA comprises a human VpreB gene and optionally the associated human promoter. Human VpreB1 and VpreB3 genes are both located in chromosome 22. The human VpreB1 gene is between human Igλ V gene segments IV-53 and 5-52 (Chromosome 22: 22599200 to 22599926). Therefore, the human lambda gene segment which covers the human VpreB1 gene and contains multiple Vs flanking this gene is inserted in the endogenous lambda antibody locus upstream of the endogenous lambda constant region between the most 3′ endogenous JA and the endogenous CA following the approach in Example 1 and as further explained in WO2011004129.

Example 3: Introducing the Human VpreB Gene within the Mouse VpreB-λ5 Locus and Targeting into the Mouse Heavy or Lambda Locus

The following example illustrates the insertion of the human VpreB1 gene within the mouse VpreB-λ5 locus to bring the human gene under the endogenous LCR influence. The mouse VpreB1 and λ5 genes are closely associated, both within a 19-kb fragment called the λ5-VpreB1 locus. Apart from these two genes, this locus contains a locus control region (LCR) required for correct levels of expression and tissue-specificity of λ5 and VpreB1. Locus control regions (LCRs) are sequences that mediate reorganisation of chromatin and activation of transcription by sequence-specific transcription factors. The characteristic of an LCR is the ability to drive gene expression in transgenic mice at any site of integration at levels that are substantially equivalent to those of the gene in its natural location. It has been shown that the LCR in the λ5-VpreB1 locus is able to promote efficient and stage-specific expression of both genes in transgenic mice at all sites of integration tested.

To make sure that the expression of the inserted human VpreB1 gene is properly controlled in mouse, the mouse λ5-VpreB1 locus is retrieved and cloned into a vector (pBlueScript II SK(+)). The mouse VpreB1 nucleotide sequence is replaced by the human VpreB1 sequence by cloning. The construct containing the fragment of mouse λ5-human VpreB1 locus is then cloned into a BAC containing a human heavy chain locus fragment. Then it is targeted into the genome of a mouse ES cell following the introduction of the human BAC into the mouse heavy chain locus upstream of the endogenous heavy chain mu constant region. In this case, the human VpreB function can be directly tested in the F1 mouse generated from the ES cells. Manipulation of constructs and BACs can be effected by standard recombineering.

Insertion of the mouse λ5-human VpreB1 can be carried out using homologous recombination or RMCE. Homologous combination can employ homology arms corresponding to the endogenous sequences flanking the mouse λ5-mouse VpreB1 in the mouse ES genome, so that insertion of the mouse λ5-human VpreB1 replaces (and thus deletes) the endogenous mouse λ5-mouse VpreB1 and places the inserted construct under the control of the endogenous LCR.

RMCE can be used to insert the mouse λ5-human VpreB1 construct (optionally with the endogenous mouse λ5-mouse VpreB1 LCR) anywhere in the genome. RMCE can also be used to precisely delete the endogenous mouse λ5-mouse VpreB1 so that the inserted mouse λ5-human VpreB1 construct is the only source of a VpreB1 gene.

In the alternative, the method can be modified to insert the mouse lambda 5-human VpreB1 construct into the endogenous lambda5-VpreB locus on the mouse lambda locus, for example to bring the human VpreB1 gene under endogenous control (eg, control of the lambda5-VpreB LCR). To do this, the cloned mouse lambda 5-human VpreB1 construct (produced by standard recombineering) is inserted into a homologous recombination vector using in vitro recombineering to add homology arms flanking the construct. Using homologous recombination in the presence of a marker carried by the vector, the construct is inserted precisely to replace the endogenous mouse lambda 5-mouse VpreB1 in the genome of a mouse ES cell. Alternatively, instead of homologous recombination, standard RMCE can be used to effect precise insertion into the genome.

A suitable protocol is provided in Example 5.

Example 4: Assessing the B-Cell Development and Repertoire in Chimaeric Mouse Containing Human VpreB Gene

The following example is provided on assessing the advantage of having the human VpreB gene in the chimaeric mouse.

The mouse embryonic stem cells (eg, AB2.1 cells; Baylor College of Medicine) containing the targeted human VpreB1 gene are grown on a feeder layer of SNL7 fibroblasts (Baylor College of Medicine) in embryonic stem cell medium containing 15% serum. Then the cells are microinjected into blastocysts which are transferred to the uteri of pseudopregnant F₁ female mice. To test for germline transmission, male chimeras are bred to C57BL/6-Tyr^(c-Brd) albino female mice (Baylor College of Medicine). After obtaining mice with both chimaeric heavy chain gene (human V region and mouse C region) and human VpreB1 gene, the following assays are performed to show the advantage of human VpreB v mouse VpreB.

1. Pre-B Cell Populations Containing Pre-BCR from Human VpreB v Mouse VpreB

In the pre-B cells, μ heavy chain associated with surrogate light chain will deposit on the cell surface to signal the cell survival and proliferation. The pre-B cell populations with the pre-BCR from human VpreB protein v mouse VpreB protein are analyzed by flow cytometry.

Cells from bone marrow are surface stained with fluorescence-labelled mAb and then analysied with MACSQuant (Miltenyi Biotech) on B220⁺ cells. Pre-B cell population can be detected using pre-BCR-specific mAB and/or mouse λ5⁺ cells. The cells with pre-BCR of human VpreB v mouse VpreB are distinguished by using monoclonal antibody specifically to human or mouse VpreB.

2. Usage of Human Vs in the Mouse with Chimaeric Antibody

In the mouse with the insertion of human V_(H)D_(H)J_(H)s between endogenous mouse V_(H)D_(H)J_(H)s and C_(H)s, the usage of human Vs v endogenous mouse Vs are compared by sequencing the heavy chain products in naive mice.

Approach:

-   -   a. Total RNA extraction (use TRIzol® Reagent) from mouse spleen     -   b. By a standard 5′ RACE (Rapid amplification of 5′         complementary DNA ends) method, using a mouse Cμ-specific         primer, PCR fragments from mouse synthesized heavy chain mRNAs         can be amplified.     -   c. The PCR products are cloned into a vector and then sequenced.     -   d. The sequences map to heavy V D J sequences which contain         either mouse V or human V. Then we compare the usage of human Vs         v mouse Vs.     -   e. The usage of human Vs v mouse Vs is analysed in the mice         with/without a human VpreB1 gene. We expect by introducing human         VpreB1 gene, the human V usage will increase in the mouse with         chimaeric IgH gene.

3. Improvement of Repertoire and Diversity of Chimaeric Antibody

In this embodiment, in mice with the chimaeric heavy locus, endogenous mouse V genes are inactivated by deletion or inversion of the mouse V locus. Therefore, all the heavy chain products are transcribed from human Vs.

Using the same methods in Example 3, the heavy chain products from naïve and/or immunized mice are sequenced and map to human V_(H)D_(H)J_(H)S with different CDR sequences.

The repertoire of heavy V D J selected by surrogate light chain from human VpreB and mouse VpreB can be compared from several aspects. First is the improvement of usage of human heavy V D Js and hypervariabilities for mice having human VpreB. Second is the shift of usage in terms of human heavy V gene families caused by human VpreB v mouse VpreB.

Example 5: Surrogate Light Chain Targeting Protocol

Mouse λ5 and VpreB1 sequences will be substituted with human λ5 and VPREB1 sequences. The regulation and expression will be controlled by mouse endogenous regulatory elements (FIG. 1; references 1 & 2) that are used during B cell development. In a mouse genome this VpreB1 is found on chromosome 16 at coordinates 16,868,494-16,869,348. In a human genome VPREB1 is found on chromosome 22 at coordinates 22,599,087-22,599,927.

REFERENCES

-   1. Minaee S, Farmer D, Georgiou A, Sabbattini P, Webster Z, Chow C     M, Dillon N, Mapping and functional analysis of regulatory sequences     in the mouse lambda5-VpreB1 domain, Mol Immunol. 2005 July;     42(11):1283-92. -   2. Sabbattini P, Georgiou A, Sinclair C, Dillon N., Analysis of mice     with single and multiple copies of transgenes reveals a novel     arrangement for the lambda5-VpreB1 locus control region, Mol Cell     Biol. 1999 January; 19(1):671-9.

Insertion of Human VpreB1 into Mouse BAC

A bacterial artificial chromosome (BAC) containing mouse λ5 and VpreB1 genes will be used for recombineering in bacteria (E. coli) to replace mouse genes with human genes. A suitable source of BACs is mouse BAC library collection, RP23-220N17 (BacPac Resource Center; http://bacpac.chori.org). The following steps will be followed to generate a targeting construct that will be used further for targeting of DNA into mouse AB2.1 ES cells and generation of a transgenic mouse harbouring the human sequences.

The first step is to delete the mouse VpreB1 gene on a BAC containing this mouse gene, by using a positive/negative selection cassette flanked by 5′ and 3′ 50 bp homology arms (ie, stretches of mouse genomic sequence that flank the endogenous VpreB1 gene in the mouse genome—see sequences A and B below and denoted as BAC Arms A and B respectively in FIG. 1B). The cassette contains ampicillin and streptomycin genes between the homology arms. This cassette is able to select clones that have been correctly targeted, FIGS. 1A and 1B. The selection cassette will be inserted and targeted using homologous recombination to replace the endogenous mouse VpreB1 gene. Positive clones will be checked using positive selection, ampicillin, followed by PCR based confirmation (denoted by the arrows on the second schematic in FIG. 1A and by means of PRIMER1 to PRIMER4 in FIG. 1B). As shown, the PCR based confirmation will be performed using specific primers situated within the inserted DNA sequence whereas, other primers will be outside the targeted region. A positive amplification product can only be achieved if the newly inserted DNA is targeted correctly to the desired locus. Using 50 bp homology arms (sequences A and B below) that are added to flank the human VPREB1 gene, a vector will be constructed and used to target and replace the positive/negative selection cassette specifically using homologous recombination (bottom schematics in FIGS. 1A and B). Correctly targeted clones containing human VPREB1 (targeted mouse BACs) will be screened using a negative selection, streptomycin and PCR based method. A source of human DNA for the human VPREB1 gene is RP11-373H24 BAC (Invitrogen; Roswell Park Cancer Institute).

Sequence A: 5′ Arm acctggccaaactgagcatgacctttgacctagccagctc ttaaacttgttctgagatcacaaaccagccagaccaaatt Sequence B: 3′ Arm tcctcccagaatgcttccctgggtcaaacccagagccaca aaggcttccattagaccattctggtaagtgacagagtcac

Retrieval of 19 kb DNA from Mouse BAC

Next, the 19 kb of DNA (see Sabbattini references cited supra) containing human VPREB1, mouse λ5 and the mouse regulatory elements will be removed from the BAC using homologous recombination in E coli. The region will be retrieved from the previously targeted mouse BAC into a smaller vector containing 5′ and 3′ mouse homology arms (FIG. 2; Mouse Arms 1 and 2). The arms are constructed using mouse BAC as a DNA template to maintain high nucleotide homology. Amplified arms are digested with specific restriction enzymes and cloned into targeting vector (the resulting targeting vector is shown in the top schematic in FIG. 2). The targeting vector is linearised between two homology arms and electroporated into E. coli which contains the previously targeted human VPREB1 gene.

Mouse Arm 1 Sequence: AAAAAAAAAAAAGCCCAGCTAGCTTAGTTGGTAGAGCATGAGACTCTT AATCTCAGAGTCATGGGTTCAGGCCTCATGTTTGGCACCATCTATAGT GTGCAGTTATAAATCAAACAGTTCACGATGTGGCTGGCTAGGCACTGG CAACTGCAGTCTCACCTGCTCCCATGGTTCCCAGTTCCCACAGCTAGT TTGCTGCCAGGCTGTTCACACTTCCCAGGTCATCTACCCACTGTGGCA AGCCTGCAGAAAGCCTGCTATTGCTAGCTCAGATTCCCTTAGCCCTAT AAAATGATAACACCACAGACTTTTACTATACCATCCAGATTTATAACA GTAATTCTCCAACCC Mouse Arm 2 Sequence: CAGGAGTACACCAAAATGTCTAGCCAAAATTTTTATATATGATCACTT AAATAAGACTCCTTAACATAAACCTACATGATATACCAAGTCTTTTCT GCCAAGGCTCTGACACTATAGTTTGTCCTATTCTGGAGTTAGGTAAGC AAAGGGCTATTTAGGTGTGGATTGCAAAGAGAGAATAGCAAGACAACC TGCCCATTCTTTGCCACACCTCACTAATCAGTGTCCCTTGGAAAGCAC TGTAAATATGGAGGTTTCTTTTTGTATTATGTAGTAGTGGATTTAACT TGAGGAGCCCCAAAAGGGGTCAGCAAAGCATGGGAAATCTAAGAATTT AACACTTCAGTGACTTTTAATCACCTACAGATCCAGGAAAATAAGCCT GTCTCTT

The resulting vector contains the 19 kb sequence containing mouse λ5 and human VPREB1 genes (second schematic in FIG. 2). This vector will allow for amplification of the retrieved 19 kb sequence and use for further insertion into a human BAC of choice (eeg, a BAC containing human antibody gene segments, such as a BAC obtainable from the RPCI-11 library available from Invitrogen). The insertion of the 19 kb DNA fragment will be targeted into a BAC containing human IgH V gene segments (FIGS. 3A and 3B; which can be read in conjunction with the disclosure of WO2011004192, the disclosure of which is incorporated herein by reference), eg, to target the 19 kb sequence directly 5′ of the most distal 5′ end of the human IgH locus to avoid any possible interference with normal IgH rearrangement subsequently in a mouse or B-cell progeny. To this end, using a series of BACs, human Ig gene segments (VDJ) are inserted in a series of steps into the genome of a mouse ES cell (eg, an AB2.1 cell) using sequential recombinase mediated cassette exchange (sRMCE) or homologous recombination as is known in the art. The 19 kb sequence can be inserted directly 5′ of the 5′ VH in the last BAC to be used, thus providing the 19 kb sequence immediately 5′ of the first human VH in a transgenic IgH locus after insertion of the 19 kb sequence and human VH sequences into the mouse ES cell genome. Thus, a human VPREB1/mouse λ5 gene insert is provided for expression of chimaeric surrogate light chain (SLC) in a subsequent progeny mouse or B-cell.

Humanisation of λ5

Similar procedures of targeting and genetic engineering steps will be used to modify the second gene of SLC, namely the λ5 (FIG. 4), in order to replace mouse λ5 with human λ5.

Correctly targeted ES will be used to generate a transgenic mouse. Analyses will be performed using animals which harbour (i) the human VPREB1 and mouse λ5 (humanVB1/mouse λ5) or (ii) human VPREB1 and human λ5 (human VB1/λ5). Differences in populations of expressed antibodies between a control wild type (WT) mouse and (i) or (ii) will be recorded using Fluorescence-Activated Cell Sorting (FACS). The end point data will be generated using mice spleen to analyse the contribution of human VPREB1 only or in conjunction with human λ5 in expansion of B cell population harbouring human sequences and the generation of more diverse antibody repertoires.

Example 6: Engineering of Non-Human Vertebrate Genome—Chimaeric Surrogate Light Chains Significantly Change Human Variable Region Repertoires In Vivo

Construction of Transgenic IgH Loci

Insertion of human heavy gene segments from a 1st IGH BAC (RP11-1065N8) into the IGH locus of mouse AB2.1 ES cells (Baylor College of Medicine) was performed to create a heavy chain allele denoted the 51 allele. The inserted human sequence corresponds to the sequence of human chromosome 14 from position 106494908 to position 106328951 and comprises functional heavy gene segments V_(H)2-5, V_(H)7-4-1, V_(H)4-4, V_(H)1-3, V_(H)1-2, V_(H)6-1, D1-1, D2-2, D3-9, D3-10, D4-11, D5-12, D6-13, D1-14, D2-15, D3-16, D4-17, D5-18, D6-19, D1-20, D2-21, D3-22, D4-23, D5-24, D6-25, D1-26, D7-27, J_(H)1, J_(H)2, J_(H)3, J_(H)4, J_(H)5 and J_(H)6 (in 5′ to 3′ order), wherein the JH6 was chosen to be the human JH6*02 variant. The insertion was made between positions 114666435 and 114666436 on mouse chromosome 12, which is upstream of the mouse Cμ region. The mouse V_(H), D and J_(H) gene segments were retained in the locus, immediately upstream of (5′ of) the inserted human heavy chain DNA.

A second allele, S2 was constructed in which more human functional V_(H) gene segments were inserted upstream (5′) of the 5′-most V_(H) inserted in the 51 allele by the sequential insertion of human DNA from a second BAC (BAC2). The inserted human sequence from BAC2 corresponds to the sequence of human chromosome 14 from position 106601551 to position 106494909 and comprises functional heavy chain gene segments V_(H)3-13, V_(H)3-11, V_(H)3-9, V_(H)1-8, V_(H)3-7. The mouse V_(H), D and J_(H) gene segments were retained in the locus, immediately upstream of (5′ of) the inserted human heavy chain DNA.

Mice bearing the S2 insertion into an endogenous heavy chain locus were generated from the ES cells using standard procedures. The other endogenous heavy chain locus was inactivated in the mice by insertion of an inactivating sequence comprising neo^(n) into the mouse J_(H)-Cμ intron (to produce the “HA” allele). The mice retained mouse VpreB and λ5, but no human VpreB gene (control, S2 mouse) or a human VpreB gene was included in the genome (S2/hVpreB mouse, ie, chimaeric surrogate light chain mouse according to the invention comprising a human VpreB1 gene with mouse VpreB1 promoter-driven expression). The human VpreB was inserted from a BAC using recombinase mediated cassette exchange (RMCE) upon construction of the S2/hVpreB mouse according to the invention. The latter mouse was a chimaeric mouse having around 30% Agouti coat colour, which is indicative of the potential fraction of B cells derived from the injected ES cells bearing the human VpreB gene. Thus, in the chimaeric mouse of the invention most B-cells do not have the human VpreB. This provides for a more challenging test versus control, as explained below. An alternative would be to breed a mouse of the invention where all cells harbour a human VpreB gene, and this is expected to show the advantage of the invention in producing a different repertoire as demonstrated below for the chimaeric test mouse.

Specifically, the following human gene segments were included (in 5′ to 3′ order):—

-   -   human VH gene segments VH3-13, 3-11, 3-9, 1-8, 3-7, 2-5, 7-4-1,         4-4, 1-3, 1-2 and 6-1;     -   human D gene segments D1-1, 2-2, 3-9, 3-10, 4-11, 5-12, 6-13,         1-14, 2-15, 3-16, 4-17, 5-18, 6-19, 1-20, 2-21, 3-22, 4-23,         5-24, 6-25, 1-26 and 7-27; and.     -   Human JH gene segments J1, J2, J3, J4, J5 and J6.

RNA was extracted from the spleens of both the control and S2/hVpreB mouse of the invention. RT-PCR was performed as described in the VDJ sequencing methodology below. The mouse Cμ transcripts were captured and sequenced, and only human JH transcripts were used for usage analysis to ensure use of transcripts only derived from the transgenic IgH loci bearing human variable region gene segments.

VDJ-Sequencing Methodology

The following methodology was performed to sequence human IgH variable regions from RNA samples obtained from the mouse of the invention comprising a transgenic IgH locus and a human VpreB gene. Comparison of the control with a chimaeric mouse of the invention is more challenging than comparison with a mouse of the invention that is 100% transgenic (ie, where all cells bear the transgenic IgH locus and human VpreB gene). In the chimaeric mouse, only a proportion of the cells comprise the transgenic IgH locus and human VpreB gene, and in those cells the IgH is present in one copy (heterozygous). Thus, the comparison in this example surprisingly was still able to show a difference in repertoires produced in the control and invention mice despite the chimaeric nature of the latter.

Sequences were filtered so that the resulting data set related to human variable regions only (and not variable regions having mouse sequence).

V gene usage frequency was assessed by next generation sequencing of VDJ recombined transcripts using the Illumina Miseq platform (see, eg, Nature. 2008 Nov. 6; 456(7218):53-9; “Accurate whole human genome sequencing using reversible terminator chemistry”; Bentley D R et al). Libraries of VDJ sequences were generated with standard adapter sequences at either end that were compatible with Illumina sequencers. The Illumina adapter sequences are termed P5 and P7 permit the binding of DNA fragments to the Illumina flowcell and are the initiation sites of the sequencing reactions. The VDJ libraries were generated using the following methodology: 5′ RACE was performed on 5 μg of total RNA to generate cDNA with a known sequence at the 5′ end using the ExactSTART Eukaryotic mRNA 5′- & 3′ RACE kit (Epicentre) using the manufacturer's protocol with the following modifications: The 5′ RACE acceptor oligo was replaced with an RNA oligo (P7 RNA oligo: AGACGUGUGCUCUUCCGAUCU) specific to a 21 bp portion of the Illumina P7 adapter. The first-strand cDNA was synthesised using a reverse primer (IgM RT primer: GAAGACATTTGGGAAGGACTG) specific to the first exon of the IgM constant region. Second strand synthesis, and enrichment of VDJ sequences was achieved by performing 15 cycles of PCR using a primer specific to the ligated RNA oligo (P7 PCR 1: [GTGACTGGAGTTC]AGACGTGTGCTCTTCCGATCT) that also included additional Illumina P7 sequence at the 5′ end (shown by square brackets). Paired with a primer specific to the IgM constant region/J segment splice junction (IgM P5 PCR 1: [ACACTCTTTCCCTACACGACGCTCTTCCGATCTNN]GGGAAGGACTGACTCTCTGA) that also included additional Illumina P5 sequence, and two bases of random nucleotides, at the 5′ end (shown by square brackets).

The PCR reaction was purified using gel electrophoresis and the major PCR product that corresponded to VDJ transcripts (˜600 bp) was excised and purified using a Qiagen gel extraction kit according to the manufacturer's protocol. A second round of 15-cycle PCR was performed to add the remaining portions of the Illumina P5 & P7 adapter sequence and to incorporate a barcode, so multiple samples could be pooled and sequenced together in a single run. The forward primer was specific to the P5 sequence with the additional P5 flowcell binding sequence shown within square brackets (P5 PCR 2: [AATGATACGGCGACCACCGAGATCT]ACACTCTTTCCCTACACGACGCTCTT). The reverse primer was specific to the P7 sequence and incorporated a 6 bp barcode and the P7 flowcell binding sequence shown within square brackets (P7 PCR 2 index X: [CAAGCAGAAGACGGCATACGAGAT******]GTGACTGGAGTTCAGACGTGT). The 6 bp barcodes were standard Illumina 6 bp indexes, 4 different indexes were used in this case and the sequences were: CGTGAT, GCCTAA, GATCTG, & TCAAGT. The final PCR products were purified using AMPure XP beads (Beckman Coulter) and were pooled together and sequenced on the consisted of a VDJ transcribed gene with a complete P5 and P7 Illumina adapter on either end. This construct was suitable for next generation sequencing on the Illumina Mi-Seq system using the TruSeq DNA sequencing program2×150 bp paired end sequencing program.

Bioinformatics & Statistical Analysis

The sequences from the Illumina MiSeq were obtained as a set of 4 BAM files from the machine, each file corresponding to a different barcode. Barcodes 1 and 3 were from the surrogate light chain samples, while barcodes 7 and 8 were from the control samples. These BAM files were converted into fastq using bam2fastq using the command: bam2fastq <file>.bam -o<file>#.fastq which splits the paired reads into two different fastq files.

These fastq files were then converted into fasta using fastq_to_fasta -I<file>.fastq -o<file>.fasta -Q33. This yielded a set of 8 fasta files.

The fasta files were then analysed with Basic Local Alignment Search Tool (BLAST) (http://blast.ncbi.nlm.nih.gov/) to find J and V segments from mice and humans. The paired reads were used to identify the J and V used, with the read from the constant region end being used to find the J and the read from the 5′ end of the RNA being used to identify the V gene used. As the transcription occurred before the start of the V segment, 5′ UTR sequence from Ensemble (http://www.ensembl.org) was used to search against to find the V genes used.

An InforSense workflow was used to then combine the results from the paired reads by matching their unique identifier to the results from the two Blast searches. These combined results were then filtered for sequences which used a Human J gene and a Human V gene. Because the sample with the chimaeric surrogate light chain was a wild type/transgenic chimaera, there were far fewer samples with a Human J than in the control, which is expected.

To avoid any errors due to PCR duplication events sequences with identical CDR3 regions, as determined by the IMGT numbering scheme, were treated as one sequence. This reduced the number of sequences in the control sample from 2,632,503 to 168,551 and in the surrogate light chain sample from 2,543,248 to 590,462.

The final counts after filtering for Human J, Human V and treating each CDR3 string as single sequence gave 10,781 sequences for the control, and 267 for the surrogate light chain sample. The low percentage of sequences with an identified Human V is due to the 150 bp reads from the Illumina MiSeq not extending very far into the variable part of the V genes. The results are shown in Table 3. In the table, the mouse of the invention is labelled “Chimaeric SLC” and significant changes in gene segment usage are underlined.

TABLE 3 Chimaeric Chimaeric Control Total SLC Total Control % SLC % V6-1 587 38 5.44% 14.23% V1-2 155 3 1.44%  1.12% V1-3 1167 43 10.82%  16.10% V4-4 984 33 9.13% 12.36% V2-5 409 7 3.79%  2.62% V3-7 4095 66 37.98%  24.72% V1-8 1315 21 12.20%   7.87% V3-9 893 40 8.28% 14.98% V3-11 443 3 4.11%  1.12% V3-13 733 13 6.80%  4.87% sum 10781 267 IGHJ1*01 131 13 1.21%  4.87% IGHJ2*01 260 13 2.41%  4.87% IGHJ3*02 1011 34 9.37% 12.73% IGHJ4*02 5467 153 50.69%  57.30% IGHJ5*02 867 12 8.04%  4.49% IGHJ6*02 3050 42 28.28%  15.73% sum 10786 267

The usage between the two samples was compared using a standard Pearson's chi-squared test (see eg, NIST/SEMATECH e-Handbook of Statistical Methods, http://www.itl.nist.gov/div898/handbook/, April 2012). This gave the probabilities that the distribution of the surrogate light chain sample and the control sample being the same was 5.8E-15 for the V genes and 8.2E-12 for the J genes. These are below the standard 5% (ie, 0.05) probability cut off and thus are significant.

The individual V and J genes which showed the most significant differences between the two are summarised in the Table 4 below. The probability shown is the probability of obtaining by chance that number or more in the case of an increase or that number or less in the cast of a decrease, using a binomial distribution using the probabilities from the ratios obtained from the control sample. The individual V and J genes in the table have a probability score below the standard 5% (ie, 0.05) probability cut off and thus are significant.

TABLE 4 Expected Chimaeric from Gene Usage SLC Count Control Probability IGHV6-1 Increased 38 14.5 2.38E−08 IGHJ1 Increased 13 3.2 6.31E−06 IGHV3-9 Increased 40 22.1 9.80E−05 IGHV1-3 Increased 43 28.9 3.21E−03 IGHJ6 Decreased 42 75.5 1.09E−06 IGHV3-7 Decreased 66 101.4 2.94E−06 IGHV3-11 Decreased 3 11.0 4.42E−03 IGHV1-8 Decreased 21 32.574 1.52E−02

SEQUENCE LISTING SEQ ID NO: 1 Human VpreB1 nucleotide sequence (GenBank Accession No = NG_029387.1) gagtcagagctctgcatgtctgcaccatgtcctgggctcctgtcctgctcatgctgtttgtctactgcacaggtgaggga acccccagatcccaaagactcctgccccttccttcatcctgccctgcccccacggcccacatgcatctgtgtcaccaggt tgtggtcctcagccggtgctgcatcagccgccggccatgtcctcggcccttggaaccacaatccgcctcacctgcaccct gaggaacgaccatgacatcggtgtgtacagcgtctactggtaccagcagaggccgggccaccctcccaggttcctgctga gatatttctcacaatcagacaagagccagggcccccaggtcccccctcgcttctctggatccaaagatgtggccaggaac agggggtatttgagcatctctgagctgcagcctgaggacgaggctatgtattactgtgctatgggggcccgcagctcgga gaaggaggagagggagagggagtgggaggaagaaatggaacccactgcagccaggacacgtgtcccttgaactgaagaca gcagaggcacgcatccccttggagagactgtcatggaagagggtggagccgccgcccgaagcgccgaggaggctgagcca ctcagcatctcctggtcctgcagtgttgctgtaaatccccattggagactgcattagggaattaaagctgcttgtcactt tttgctg SEQ ID NO: 2 Human VpreB amino acid sequence (GenBank Accession No = NP_009059.1) mswapvllmlfvyctgcgpqpvlhqppamssalgttirltctlrndhdigvysvywyqqrpghpprfllryfsqsdksqg pqvpprfsgskdvarnrgylsiselqpedeamyycamgarssekeerereweeemeptaartrvp SEQ ID NO: 3 Human VpreB3 nucleotide sequence (GenBank Accession No = NC_000022.10) cttcccagccctgtgccccaaagcacctggagcatatagccttgcagaacttctacttgcctgcctccctgcctctggcc atggcctgccggtgcctcagcttccttctgatggggaccttcctgtcaggtgaatctttcctgggcctcagtcacctggg tgtggggtggggaacagtactggccccagaaggcccctgcaaggaggcaaatcgatggggacagtagggcaggtcctggg aggggtattttttttttttttgagacggagttttgctcttgttgcccaggctggagtgcaacggtgcaatctcggctcac tgcacctctgcctcccaggttcaagcgattcttctgcctcagcgtcctgagtagctgggattacaggcatgcgccaccac gcctggctaattttgtatttttagtagagacaaggtttcttcatgttggtcaggctggtctcaaactcctgacctcaggt gatccgcccacctcagcttcccaaagtgctaggattacaggcttgagccactgcacctggctgggaggcgtattcaggta gaggggtgacctggcttcagacagctctgcccttgacctggggcaaatcacttccctgtgtgggccttggttttctcatt cgtgaatgaccagatcactagagccctggactctgactttgggtctccttcttagtttttcacaggggaagctattgtgg gttggtccctaccccacagggcctgaggcaatctaacctccctgagagggtccctgcagccagttgcctgaggctgagtt gatgtgtgggacagcccaggaggttcctgggggtggttagtctgtattcagggtttggaagagctgaagtgaagtgggcg gggaagtgggggagaggggtgcagttctgcagagaaacgtgggtgggtagcagagggacctagaggctgctagtccaacc ttctgagctctgggcctttaactgaacacagatccttgaggatatggcactaatggagatttgggggctaactccaaacc cctcactcacaaagggagatggaggtccagatagggcaccctaagtcacacagaaccaggcctcctgcaccccgttcatt gctaatcccatagcactgggctatgagccctttgggactgggagtctccatggagtccaaccaagccttcacagggcagg ggtgggagggaaggggctcaggctgagtgggtttgtgtctcgcagtttcccagacagtcctggcccagctggatgcactg ctggtcttcccaggccaagtggctcaactctcctgcacgctcagcccccagcacgtcaccatcagggactacggtgtgtc ctggtaccagcagcgggcaggcagtgcccctcgatatctcctctactaccgctcggaggaggatcaccaccggcctgctg acatccccgatcgattctcggcagccaaggatgaggcccacaatgcctgtgtcctcaccattagtcccgtgcagcctgaa gacgacgcggattactactgctctgttggctacggctttagtccctaggggtggggtgtgagatgggtgcctcccctctg cctcccatttctgcccctgaccttgggtcccttttaaactttctctgagccttgcttcccctctgtaaaatgggttaata atattcaacatgtcaacaaca SEQ ID NO: 4 Human VpreB3 amino acid sequence (GenBank Accession No = NP_037510) macrclsfllmgtflsvsqtvlaqldallvfpgqvaqlsctlspqhvtirdygvswyqqragsapryllyyrseedhhrp adipdrfsaakdeahnacvltispvqpeddadyycsvgygfsp SEQ ID NO: 5 Mouse VpreB1 nucleotide sequence (GenBank Accession No = NC_000082.5) agagcccagaaagcctgggagggtggtgagcaggaaccaggggtgcagtgaccctctccccaaagcagggaggagagtgc ttcccagctggtcagggcccaggagcagtggctgtagggggcagggtgctgcaggtctggagccatggcctggacgtctg tcctgctcatgctgctggcctatctcacaggtaaggaaactcttggggcccagggcttctttgctcctcctatggccttg ctctgccccagtgatggacatgcttctatcttctcaggttgtggccctcagcccatggtgcatcagccaccattagcatc ttcttcccttggagccaccatccgcctctcctgtaccctgagcaacgaccataacattggcatttacagcatttactggt accagcagaggccgggccaccctcccaggttcctgctgagatacttctcacactcagacaagcaccagggtcccgatatc ccacctcgcttctccgggtccaaagatacgaccaggaacctggggtatctgagcatctctgaactgcagcctgaggacga ggctgtgtattactgtgccgtggggctccggagccaggaaaagaagaggatggagagggagtgggaaggagaaaagtcgt atacagatttgggatcttaggctctggagacattcagaccctgaactgaagacagagtttgctttgctcggctagtctgg tatgggaaggaggggtagaacgtgaggttttgcagagcctagaagatggaattatgcagcttttccttgttctgcggtgt tgctatgagcccccattggaggctggattgtagaattaaagctgtttttactgaa SEQ ID NO: 6 Mouse VpreB1 amino acid sequence (GenBank Accession No = NP_058678) mawtsvllmllayltgcgpqpmvhqpplassslgatirlsctlsndhnigiysiywyqqrpghpprfllryfshsdkhqg pdipprfsgskdttrnlgylsiselqpedeavyycavglrsqekkrmerewegeksytdlgs SEQ ID NO: 7 Mouse VpreB2 nucleotide sequence (GenBank Accession No = NC_000082) atggcctggacgtctgtcctgctcatgctgctggcccacctcacaggtaagggaactcttggggtccagggcttccttgc tcctcctgtggccttgctctgccccagtgatggacatgcttctatcttctcaggttgtggccctcagcccatggtgcatc agccaccatcagcatcttcttcccttggagccaccatccgcctctcctgtaccctgagcaacgaccataacattggcatt tacagcatttactggtaccagcagaggccgggccaccctcccaggttcctgctgagatacttctcacactcagacaagca ccagggtcccgatatcccacctcgcttctccgggtccaaagatacggccaggaacctggggtatctgagcatctctgaac tgcagcctgaggacgaggctgtgtattactgcgctgtggggctccggagccacgaaaagaagagaatggagagagagtgg gaaggagaaaagtcgtatacagatttgggatcttag SEQ ID NO: 8 Mouse VpreB2 amino acid sequence (GenBank Accession No = NP_058679.1) mawtsvllmllahltgcgpqpmvhqppsassslgatirlsctlsndhnigiysiywyqqrpghpprfllryfshsdkhqg pdipprfsgskdtarnlgylsiselqpedeavyycavglrshekkrmerewegeksytdlgs SEQ ID NO: 9 Rat VpreB1 nucleotide sequence (GenBank Accession No = NM_001108845.1) ccagaaagcctgggagggtggtgagcaggaaccagtggtgcaaagcaggggcgagactgcttcctagctggtcagggcac cggagcagtggctgtaggggtcagggtgctgcaggtctggaaccatggcctggacgtctgccctgctcatactgctggcc catctcacaggtacgggaactcttggggcccagagcctccttgctcctcctcttgccttgctctgccgcagtgatgggca cgcttctattttctcaggttgtggccctcagcccgtgctgcatcagccaccatcggcctcttccttccttggaacctcca tccgcctcacctgtgccctgagcagcaaccataacattggcatttacagcatttactggtaccagcagaggccgggccac cctcccacgttcctgctgagattcttctcacactcagacaagctccagggtcccaagatcccccctcgcttctccggatc caaagatacagccaggaacctggggtacctgagcatctctgacctgcagccagaggacgaggctgtgtattactgcgccg tggggcttcggagctgggaaaaggagaagaggatggagagggagtgggaagaagaaaagtagcggacagattcgggatct taggctctggagacattcagacctagaaccgaagacggagtttgctttgctcggctaggctggtttggggaggaggggta gaacaccgggcttcgcagagccaggaaggtggagccagccgcttttccttgtattgcagtgttgctatgcgccccatcgg aggctggattgtagaattaaagctgttttttttttttttgttttgtttttttgttttttgtttcttttttcttaactg SEQ ID NO: 10 Rat VpreB1 amino acid sequence (GenBank Accession No = NP_001102315.1) mawtsallillahltgcgpqpvlhqppsassflgtsirltcalssnhnigiysiywyqqrpghpptfllrffshsdklqg pkipprfsgskdtarnlgylsisdlqpedeavyycavglrswekekrmereweeek SEQ ID NO: 11 Rat VpreB2 nucleotide sequence (GenBank Accession No = NC_005110) atggcctggacgtctgccctgctcatactgctggcccatctcacaggtacgggaactcttggggcccagagcctccttgc tcctcctcttgccttgctctgccgcagtgatgggcacgcttctattttctcaggttgtggccctcagcccgtgctgcatc agccaccatcggcctcttccttccttggaacctccatccgcctcacctgtgccctgagcagcaaccataacattggcatt tacagcatttactggtaccagcagaggccgggccaccctcccacgttcctgctgagattcttctcacactcagacaagct ccagggtcccaagatcccccctcgcttctccggatccaaagatatagccaggaacctggggtacctgagcatctctgacc tgcagccagaggacgaggctgtgtattactgcgccgtggggcttcggagctgggaaaaggagaagaggatggagagggag tgggaagaagaaaagtagcggacagattcgggatcttaggctctggagacattcagacctagaaccgaagacggagtttg ctttgctcggctaggctggtttggggaggaggggtagaacaccgggcttcgcagagccaggaaggtggagccagccgctt ttccttgtattgcagtgttgctatgcgccccatcggaggctggattgtagaattaaagctgttttttactg SEQ ID NO: 12 Rat VpreB2 amino acid sequence (GenBank Accession No = NP_001128260) mawtsallillahltgcgpqpvlhqppsassflgtsirltcalssnhnigiysiywyqqrpghpptfllrffshsdklqg pkipprfsgskdiarnlgylsisdlqpedeavyycavglrswekekrmereweeek SEQ ID NO: 13 Human λ5 nucleotide sequence (GenBank Accession No = NG_009791) ggccacatggactggggtgcaatgggacagctgctgccagcgagagggaccagggcaccactctctagggagcccacact gcaagtcaggccacaaggacctctgaccctgagggccgatgaggccagggacaggccaggggggccttgaggcccctggt gagccaggccccaacctcaggcagcgctggcccctgctgctgctgggtctggccgtggtaacccatggcctgctgcgccc aacagctgcatcgcagagcagggccctgggccctggagcccctggaggaagcagccggtccagcctgaggagccggtggg gcaggtaaggggtgagagattccaggggatgtgggggtctgggtggcagaggcgggaaaggatgaccaaggggagacgag ccagaggggtgaggaggaaggttaatccctggaggggagccacagacactgactttaactaaagtgtcaagattttgtcc atctttgaattaatttttattgcttaatgtcatattaaaatattatttatcttgattcctgagatttcttcccccactta catttggcaccaaggccaatgtctccctcacctccccctagtccttggggtagggcaggactggaggcaggggcaggacg tccacaggagtggtggccgctatccctgaaggatgcccaggcctctccctcctcctcctcccactcctcctcccccccct cctcttttccccttggcctatgtcacctgtccactcccaccctcactgggcaggggccactccctggagctccagctaag gtgtgaggggcctttcctggagtccctgggtcactagacctcagccagcatcgcctcctgaaaccagcccctaggagaca caagcttatccagggtgcaagtgcctccaaagaagaagcccaggagaggctctagggaggccacaactccctgtgtgacc tcagccattgccaccactctgcgtgtggtgggaggtccccagacaaagcaccaagcatcggggtccatttatgagcattt gggacacaacagcctgttcactggtgcatgttatacccacaggcgataatcatttcaggggcagaacctccctctcggtg gccccataggcaggtcctgctgtgatcccctttgtgcagacggggatagagcccggagaggtgaggtgacccgtccgagt cactcagctcatgggcacagattctaaggcccaagctatcccctctagctctcccctgtcccatcctctaagctgatcga gcggacacgtgcatctctgggacctgagtttccctttttctctcttttttttctttttcaaataaagtttcacagagttt cactcttgtcgcccagtctggagtgcaatggcgagatcttggctcattgaaacctccgtctcccaggttcaagacattct cctgcctcagcttccggagtagctgggattacaggcatctgccaccacacctggctaattttgtgtatttttttttagta aagacagagtttcaccatgttggccaggctggtctcaaactcctgacctcaggtgatccacccacctcagcctcccaaag tgctgggattacaggcatgagacaccacaccgggcctgagtttccccttctgcaatctgaggggccctgactggtgaggg ccttcagcgtcccacccacccagaggatgctggggtggctgtggtgagagctccagcagtggcagccgacctgacccaca ccaggagcccggccatggaggcggggtcagcatggtggcaggccgggaccgggtgtcagtgtcctgcacggacttctgag caaggagtccccatcagggtcaggctctgtgctggggctgaggtcccagaggatctagatttgccccaattcaagtccac aaggagcgggggccgggtgaggagacagccacatgcagggtgatgcctacagaacagagactgggatggggaaggcccga ggggtctccacaagggacgggtgacaggtggagggagacacagataataaaaaatggtattatgttggggggctattaat gtaagtttttatattagaatctttagaaatcttatagaaatactatgggccgggtgctgtgtctcatgcctgtaatccca gcactttaggacgccaagatgggcagatcacgaggtcaggagattgagaccatcctggccaacatggcaaaactccgtct ctactaaaaatacaaaaactagctgggcgtggtggcgcgagcctatagtcccagctactcgggaggggaggcaggagaat cgcttgaatcagggaggtggaggttgcagtgagctgagattgcaccgctgcactccagcctggacgacagaacgagactc ccactcaaaaaaaaaaaaaggtattatgctgggggggatatgaatatgagtttttataatctttagaaatactatgggcc aggtgcagtggctcatgcctgtaatcccacaactttgggaggctgaggcgggattgtttgagcccaggagtttgagatca gcctgagcaacatagcaagaccccatttctacaaaacatataaaaactagctagtcatggtggcacttgcctgtggtccc agctacttaggaggctggtgagaggattgcttaagcctcggaggttgaggatgcagtaagctgagatcccaccactgcac tccagcctgggtgacagaaggagaccctgtctcaaaaaaaaaaaaaaaaaaaaaagactgattattcctgtagaattctg gtaaatatctcctatcaaataaatgacttttctattcatagcttattaaaagatattttcattgtttttaagaaataggt tgtgtatcactttttatatttagttgtaaatttatttgttttatttatttttttagagatgggggtctcactatgttgcc tgggctggcctccaactgctgggctcaagcaatatccctgcctcagcgtccccagtagctgggactacaagcatgcgcta ccacaccggcataattttttgtagagatgaggtttcgccatgttgcctgggctggtcttgaacccctgggctcaagccat ccacccgcctcggcctcctaaagtgctgagattacagacgtgagccaccctacctagcctgtactatttttaataggtct taataggttttgaatgttaattatttttaaattaatttcaaaaatcttctacaacatggatgaaacctgaagacattata cttagtgaaataagccagacacaaaaggacaaatgtcatttgaatccacttctatgaggtacctagaataggcaaattca cttggacaaaaagtagatttgaggttagcagggtgagggggagggaagaatgggggactgtagttaacgggtttagagtt tctgtttgggaagatgaaagagttctggagatggatggtggtgaaggttgcccaacggtgtgaatgtacttagtgccacg gagctgtacgtttaaaaatagttaaagtggaaatattgatgctatgtataaaaatggagcggggtgtggtggctcacacc tataatcccagcactttgagaggccaaggtgggcagatcacctgaggtcgggagttcgagaccagcctgacaaacatgga gaaacaccgtctctactgaaaatacaaaaaattagccaagtgtgctggcacatgcctgtaatcccagctactcgggaggc tgaagcaggagaatcgcttgaacccaggatgtggaggttgcggtgagccaaggtggcaccattgcactccagcctgggcg acaagggtgaaattccatttcaaaaaaataaaaggaaatgggactgtacatagcaggagagagagggagagatcaatatg acacttctttttttttttttttttttttgggacagtcttgctctgttgccaggctgcactccagcctgggcaacagagcg agactctgtctcaaaaaaaaaaagaatgtctagatgcagtagctaagttcgcagaagccatttcagtgtgggaggcgagg tgactgagtccagggactccaggtttctggctgaggggttgagaccgcccatggtcattgtgatgagatgaagacagaaa atgagctgggctgggggaaggtggtcatctcctctggacgtgattagtttgaggctcctgttgggtagcccactgggcaa ggtcagtaggcaattgaggagctgagagggtttggagctgggatagactccagcctcaccatgtgggcaatagtgggggt cacagagtgtgaaaatggctaagaaagaggtctgggtggggtctggggagtcagcagccaggcatgggccatggagaaac agatgccaggggaagaggaaaggggagtctcagaccccaaggggaaaagagtcactggaaagaggggccagccctgtgtc gcatccagcggagacaccaggtacagcagaaggaccttggacatgaccatgaggagggccttgttgaccatggcctggga gagatgggggtgagagcctgggggaccacaccatgtccccagcacacagtgcctggtagacaggatggatttatggatgg acggacaggtagatggatggacgaatggacagatgatagatggatgcaaagacagatgaatagatggacagatgcataga tggacagatggacagatggatggacggacggatggaatgaatgatcagaaaaggcttcatgaacaaagtgagactgagct gcatctccatgggtagatataaaagcagaggactctcctcttgagtcaggaatgacccaatgtcctggtccagggaggaa gtcagcctccttgactggggacacttgtggcagatttcagaggcccttaaaatgaggccaagtgaggtggacaggtccga gccagctgaggactcctcagccacacggcacagctgcctgaggggatgtgtcactcagggagttgctgggacctactggg cccagcgttgccatcagcaccaacagtttcagagagggggacacacgctggggcagcacctgcctcagagaagggacagg cacagagacactactgggggacactactgggacactggccacccccctaccctgtgcctgggtcacagcctacacactgc agccctgtgcccctcactcccagcaggttcctgctccagcgcggctcctggactggccccaggtgctggccccgggggtt tcaatccaagcataactcagtgacgcatgtgtttggcagcgggacccagctcaccgttttaagtaagtggctctaacttc ccaggctgtcccaccctctcctgtctctggaaaatgtgttttctctctctggggcttcttcccctctgccctcccagcct taagcactgacccctacctttgtccatggggcctggaggagatgtgttagtctcagggtaatggcaggaagggcccccac agtgggagcagccgccttcaggttccaacagcaggacacagcctggtcccagggcctgggctgggattgggcggggtcag ggctcctcccctctcccagggcagatgtctgagtgagggacagaggctggttctgatgaggggccctgcagtggccttag agacagtccctgggaccccaggttctaggctgagggctggatgcccatccagcctgggagggccacacgggggcctgggg acacaggggtcacccccaggggagaccaatggagggcacagagagggctctgggtctaggctgcagctctgtggcctctg ctgggtcttcagggcatggggacacagaggaacggatgaggtcccagagcccagccctcccaggacagtcaccagaaagg agagggtctcttagtgcagagatgtgcctgtccctggagccctgtcatctctggggcctggtgtctctctgttcatgggt cgacctcccaccttcatttgaggaagggcaccttagactcagaaggtgactagcggggagtaaacgggagtgcagagaac tccatggctgccaggtgaagtccaggggcatcagaggctgctggggtgggcatgggggctgcggtgccccaaagtctggg ggagcagccccaagaacccagccgatgtgaagggtcctgtggtcgggctggtggggacaggggcgacggcagagccccag ggtgtgtctgggtggagcccacgcttcaccaggagagctgagtgggccaggctggggcacagcctggtgccccaggggat gggaagctccaggccatgccaggcttgggtctccccacatcctgccagtatagttttgtgtgctgtgagggagaccccta gattccaaactcagactccagaaaccaggaaggagggagcacagcctgccctgggtgcacacggggaaaccgaggctgca gaggaaagggctgggccaggacacctgggaaaggtgacttgggaagggctcctaggaaggcacagggctgtctgctctcc agagggctccagtggaaaggagggaatgaggagggaaggagaggccctgggtggaccaggcggccacaccatgaaccctc ccagagactttagacagagagaggcgctccacaacaccccacactccctctgccatctctcaccccctcctctgtccaca caggtcagcccaaggccaccccctcggtcactctgttcccgccgtcctctgaggagctccaagccaacaaggctacactg gtgtgtctcatgaatgacttttatccgggaatcttgacggtgacctggaaggcagatggtacccccatcacccagggcgt ggagatgaccacgccctccaaacagagcaacaacaagtacgcggccagcagctacctgagcctgacgcccgagcagtgga ggtcccgcagaagctacagctgccaggtcatgcacgaagggagcaccgtggagaagacggtggcccctgcagaatgttca taggttcccagccccgaccccacccaaaggggcctggagctgcaggatcccaggggaagggtctctctctgcatcccaag ccatccagcccttctccctgtacccagtaaaccctaaataaataccctctttgtcaaccagaaa SEQ ID NO: 14 Human λ5 amino acid sequence (GenBank Accession No = NP_064455) mrpgtgqggleapgepgpnlrqrwpllllglavvthgllrptaasqsralgpgapggssrsslrsrwgrfllqrgswtgp rcwprgfqskhnsythvfgsgtqltvlsqpkatpsytlfppsseelqankatlyclmndfypgiltytwkadgtpitqgv emttpskqsnnkyaassylsltpeqwrsrrsyscqvmhegstvektvapaecs SEQ ID NO: 15 Mouse λ5nucleotide sequence (GenBank Accession No = AC_000038) ctggaatagcttttggccaccagaggaggaacaatccttttgccgggagatctacactgcaagtgaggctagagttgact ttggacttgagggtcaatgaagctcagagtaggacagactctgggcactatccccaggcagtgtgaagttctcctcctgc tgctgctgttgggtctagtggatggtgtccaccacatactttccccaagctcagcagaaaggagcagagctgtgggccct ggagcttcagtgggaagcaacaggcctagcctatgggcccttcccggcaggtaagagacttgctttttggggaaggtacg cgtgtaggtccacggactagaggctagaatgagtgactgggaaggaaggtggctattgaggccatgggtgtgagaaggaa ggatctgcctaagaggagggggctgtgcaaatgctagcttaaacttggtacctcgatcattgcaaggccagtgttctacc aatgaaccacatccatagcccacctccattttccatttattttgagaagaggtcttgctaggtttcccaggctgatctta ctctctgtagcctatgcaagatttgaacgtcctatagctttcctagatccttcctccttcctaagtcctctttagtagct agaattaagactgtggtggcgcgtttgtttttgttttttgttttttgtttttttgtgtgtgtgtgtgtgtgtgtcatggt cctagttcatttagaattaaatgtgtgtgtttatcgtagtcaggtagttgcagttctagaagtttattcttaggtaatta ttaaaacatccattaaatattgtgagggtgtcttgggttgggaggaggagacaccatggccatggagatcacccagtcaa gaagaccttgcctgtaaagcatgaggacttaattcagactctcagcaaccattcatctactatcaagatgtttttactca gataaatagaaaaaaaaaaaaaaaaggtttggtgtggtggcacaccttttatatcccagcagttgggaggcagaggcagg cagatctttgtgagtttgaggccagcctggtctacatagtagtctcatgatagaaataaaaacaaaagaatccttgcaat atagtgtgcatgggtcctgggaggaagcatagggccagtaggaactgatgcacacacgcatgtatgcacacacgcacaag cacatatgcacacacaaatattcaccagagattcctggttccaccacacccaccagagaagatataccacccaggaattt ctgagaatctgctgggcctgatattgccatcaacactgaaaaaattcagagatggaagtaagagatggatagatggactt gtcgggatactggctgctcatctattttctgccaaggacatagtttatttcctgaagttctgtgtctgactcacccaaca ggctcctgttccagatcatcccacggggagcaggtcccaggtgctcgccccataggcttccatctaagccccagttttgg tatgtctttggtggtgggacccagctcacaatcctaggtaagtggttctcatggtctcatgatccagctggctcagggaa gtccatttttgctctggggaattcttactatctgcctttcctagccttgcagtctgaactgtaaaggcagtagtaattct aaggtaaatcacagggaaaggccctaacagcttcatctactcttctgaggcagatgcccaaaaagagatcaaggaatgag atttttcaggccatagagaccactacacctcagcatctaaaccgaggcccagatgcccatccagactaagaagaccacat agtaggtgcagggacatggctggtgccatgaaccccaccttcaaagattcaggccaccctagatagaaaaccacagaggc tgagagagaaatttggttcaatttaatcttctgctggcccatgaggtcacagagacacacaaaaggctcagagtgatcag gttactagaaccaggtcctcccaggatgattactagaaaagaaatgtagactgtctattgttcctgagggctggagcctg ttgtctaacttgtccatcttcctcaaatatccttagacttagatgaggaatgaaggagcaaatggggccaagtgaaatca ggagtaaccagagacttcctggatggtcaggggtttgctatccttcctagtctgggagatggagcctcaggaacacagcc agtataggtcttgtgattactgtggacaaagcagtaggtccttggaggagttggaggatttttctggatggaatctaatc ctcggtgagataactaaatggaatctggagcacaggccccgaagcccttatacagcaggcacacctcaagaccaactctc caggaccagtcttgcagaataaatgagcagcaattctcagaggagtctctgagcactggagaagcaattcaggttgggga gctgccctctgcctcaccagaaggccagggtcagatcccaattcactaccaggaggcctgggttagatcccaaatcaggt tccagcttcaaggggctagagaattcagctggtcttagtctcagcgggggaactgagattgcaagggtctgggtctgggt cattttatctggaagaggaacatgttctaatgggatgctaggctgtctgctctccaggggactcaagtggtcagaggaga agaaggaagcatccctggatggaagactgatgctgtagtgaatggccacagagctcctgataagagaaggacgcttcctt atcacgtgggctctcctatgctaactcttatctccttctctatctgcgcaggtcagcccaagtctgaccccttggtcact ctgttcctgccttccttaaagaatcttcaggccaacaaggccacactagtgtgtttggtgagcgaattctacccaggtac tttggtggtggactggaaggtagatggggtccctgtcactcagggtgtagagacaacccaaccctccaaacagaccaaca acaaatacatggtcagcagctacctgacactgatatctgaccagtggatgcctcacagtagatacagctgccgggtcact catgaaggaaacactgtggagaagagtgtgtcacctgctgagtgttcttagaccacaatcctccctgaagcctcaggggc ctggatctgaagtgccagaaaaagttgttttttgttttgttttttgttttttttcccattaaccatctcactgtctttcc tgtgcctaatactcaataaatatcttaccaccaac SEQ ID NO: 16 Mouse λ5amino acid sequence (GenBank Accession No = NP_001177254) mklrygqtlgtiprqcevlllllllglvdgvhhilspssaersravgpgasvgsnrpslwalpgrllfqiiprgagprcs phrlpskpqfwyvfgggtqltilgqpksdplvtlflpslknlqankatlvclvsefypgtlvvdwkvdgvpvtqgvettq pskqtnnkymvssyltlisdqwmphsryscrvthegntveksyspaecs SEQ ID NO: 17 Rat λ5nucleotide sequence (GenBank Accession No = NC_005110) atgaagctcagggcaggacagacactgggcactatccccaggcaatgtgaaattctccttctgttgctgctgttgggcct ggtggatggtgtccaccatatactttccccaagctcagcacaaaggggcagagctgtgggccctggagcctcagtgggaa gcagcaggtctagcctgtggacccttccaggcaggtaagagactttcttataggggaatgtatgtgtgtgggtccatgga ctggaggctgaaatgggtgactgggaaggaaggtaaccattgaggccataggtgtgagaaggaaggatctgcctaagagg agggggctgggcaaatgctagcttaaacttagtacctaactcatgcaaggccagtattctatcaatgagccatatccata gcccacctcccttttccatttatttagagaagagggcttgcctgattgcctaggctgatcttactttctgtagcctatgc aagatttgacctcctagatccttcctccttcctaagtcctctttagtagctggaattaaggctggtaccaggttatttta gtgtgtcatggtcctagttcattcagaattgtgtgtgctcatcttagtcaaatagttgcagttttagaagtttattctta ggcaattatcaaaacatccattaaatattctaagagtgtcttgggctgggaagagtagacactggggacatggagaccac ccagtgaagaagactttgactgtcaagcatgaggacctagttcagactctcagcactcatgcatctgctatcaagataca atcacatgtttttttactcagataaatagaaaaaaattaaggtttggtgtggtggcacatcccttatatcccagcagttg gcaggcagaggcagatagacctttgtgagtttgaggccagtctggtttacatagtggcctcatgacagaaataaaacaaa agaattctcacaaaatatagtgtgcatgggtcctgggatgaagcacaggaccaggaggaatacacacacacacacacaca cacacacacacacacacacacacacacacaaatacacatcagagattcctggttccaccacactcaccagagaagatata ccacccagaacttttctgagaatcagttgggcctggtgttgccatcaactggatggaaataggagatggatagatggact tattttctgccaaggacacagttcatttcctgaagtccggtgcctacctcacccaacaggttcctgttccagatcatccc acggggagcaggtcccaggtgctggccccataggcttccatccaagtcccagttatggtacgtctttggtagtgggaccc agctcacaatcctaggtaagcgattcccatggtctcatgatccagctgtcttagggaagtccctttttcctctggggagt tcttctcacctgcctttcctagtcttgtagtctgaatggtacctttttctgtgagtgaggggaaggcaagtagttctaag gtaaaccacaggaaaggtcccaatagcttcagctactcttccgaggcagatgtccaaaaagggatcaggggctgaggttt tcaggctgtagagaccactgcacttcagcatctaaactgaggcccagatgcccatccagattaagaaggccacatagtag gtgcagggatatgactggtgccatgattcctgcctttaaagattcaggtaaccctagatagaaaaccacagaggctgaga gaagaatctggcccagtttaatcttctgctggccaatgaggtcatagagacacagaaaaggcttagagtgaccaggtaac tagaaccagctcctcccaggatgattactagaaaaaaaatgtagactgtctattgttcctggggtttctcaggcctggag cctgttgtctaacttgtccatctccctcaaatgtccttagagttagatgaggaatgagggagcaaatggcgccaagtgaa atcaggagtcaccagagacttcctgggtggtcaggggtttgctatccttcctagtctgggagacggagccccaggagcac agtcagagtaggtcttgtgattactgtggacaaagcatcaggcccttggaggagttcctaggacttttcaggatggaatc taatccttggtgagataactaaatagaatctggagcacaggcccgggagcccttatacagcaggcacacctcaagaccaa ctccccaggaccagtcttgcagaataaatgagcagtaattctcagaggaggctctgacactggagaagcaatgggggttg gggagctgctccctgaactcccccacccccataggaggccagcgtcagatcccaattcagattccagctcctagtggcta gagagtacagatggccttggtcttagtggggaaactgagattgcaaggggcagggtgtgggtcatttcacctggaagagg aacacggtctaatggggcaccaggctgtctgctctccaggggactcaggtgggcagaggaaaagaaggaagcatccttga tggaacactctgagctgtagtgaatggctacagggctcctgataagaggaggatgcttccctgtcatgtgggctctccta tgccaactcttatccccttctctatctgcacagggcagcccaagtctgaccccttggtcactctgttcctgccttcctta aagaatctccaggttaagaaggcgacactagtgtgtctggtgagcgaattctacccaggtactttggtggtggactggaa ggtagatgggatccctgtcactcagggtgtggagacaacccaaccctccaaacagaccaacaacaagtacgtggccagca gctacctgacactgatgtctgaccaatggatgcctcacagtagatacacctgccaggtcactcatgaaggaaacactgtg gagaagagtgtgtcacctgctgaatgttcttag SEQ ID NO: 18 Rat λ5amino acid sequence (GenBank Accession No = NP_001177270) mklragqtlgtiprqceilllllllglvdgvhhilspssaqrgravgpgasvgssrsslwtlpgrflfqiiprgagprcw phrlpsksqlwyvfgsgtqltilgqpksdplvtlflpslknlqvkkatlvclvsefypgtlvvdwkvdgipvtqgvettq pskqtnnkyvassyltlmsdqwmphsrytcqythegntveksyspaecs SEQ ID NO: 19 5′ homology arm acctggccaaactgagcatgacctttgacctagccagctcttaaacttgttctgagatcacaaaccagccagaccaaatt SEQ ID NO: 20 3′ homology arm tcctcccagaatgcttccctgggtcaaacccagagccacaaaggcttccattagaccattctggtaagtgacagagtcac SEQ ID NO: 21 5′ homology arm AAAAAAAAAAAAGCCCAGCTAGCTTAGTTGGTAGAGCATGAGACTCTTAATCTCAGAGTCATGGGTTCAGGC CTCATGTTTGGCACCATCTATAGTGTGCAGTTATAAATCAAACAGTTCACGATGTGGCTGGCTAGGCACTGGC AACTGCAGTCTCACCTGCTCCCATGGTTCCCAGTTCCCACAGCTAGTTTGCTGCCAGGCTGTTCACACTTCCCA GGTCATCTACCCACTGTGGCAAGCCTGCAGAAAGCCTGCTATTGCTAGCTCAGATTCCCTTAGCCCTATAAAAT GATAACACCACAGACTTTTACTATACCATCCAGATTTATAACAGTAATTCTCCAACCC SEQ ID NO: 22 3′ homology arm CAGGAGTACACCAAAATGTCTAGCCAAAATTTTTATATATGATCACTTAAATAAGACTCCTTAACATAAACCTA CATGATATACCAAGTCTTTTCTGCCAAGGCTCTGACACTATAGTTTGTCCTATTCTGGAGTTAGGTAAGCAAAG GGCTATTTAGGTGTGGATTGCAAAGAGAGAATAGCAAGACAACCTGCCCATTCTTTGCCACACCTCACTAATC AGTGTCCCTTGGAAAGCACTGTAAATATGGAGGTTTCTTTTTGTATTATGTAGTAGTGGATTTAACTTGAGGAG CCCCAAAAGGGGTCAGCAAAGCATGGGAAATCTAAGAATTTAACACTTCAGTGACTTTTAATCACCTACAGAT CCAGGAAAATAAGCCTGTCTCTT 

What is claimed is:
 1. A method for producing an antibody specific for a selected antigen, said antibody comprising an immunoglobulin heavy (IgH) chain comprising a human IgH chain variable region, the method comprising isolating from a cultured cell expressing said IgH chain or fragment thereof comprising said human IgH chain variable region, one or more of the group consisting of: (i) said IgH chain or fragment thereof comprising said human IgH chain variable region; and (ii) nucleic acid encoding said IgH chain or fragment thereof comprising said human IgH chain variable region; wherein said human IgH chain variable region is of a transgenic mouse contacted with said selected antigen or epitope thereof, for a time and under conditions which permit the antigen-contacted mouse to produce antibody specific for said antigen; wherein said transgenic mouse comprises an IgH chain transgene, the IgH transgene comprising (a) one or more human VH gene segments, one or more human D gene segments and one or more human JH gene segments operably connected upstream of a constant region gene so that the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding an antibody heavy chain having a human variable region and a constant region; the genome further comprising (i) a human VpreB gene capable of expressing a human VpreB, and (ii) a mouse or rat λ5 gene; wherein the transgenic mouse is capable of expressing a chimaeric surrogate light chain comprising human VpreB and mouse or rat λ5 for pairing with the heavy chain.
 2. The method of claim 1, wherein the constant region is a mouse constant region.
 3. The method of claim 1, wherein the constant region and the λ5 gene are mouse constant region and mouse λ5 gene, respectively.
 4. The method of claim 2, wherein the transgenic mouse is a strain selected from the group consisting of a 129-derived mouse strain, a C57Bl/6 derived mouse strain and a JM8 derived mouse strain.
 5. The method of claim 2, wherein the constant region is of a mouse 129 strain constant region or of a C57Bl/6 strain constant region, and the λ5 gene is a mouse 129 strain or a mouse C57Bl/6 strain.
 6. The method of claim 1, wherein the constant region gene and the λ5 gene are endogenous genes of said transgenic mouse.
 7. The method of claim 1, wherein the genome is homozygous for said transgene, human VpreB gene and mouse or rat λ5 gene.
 8. The method of claim 1, wherein endogenous antibody heavy chain expression in said mouse is inactive.
 9. The method of claim 1, wherein endogenous antibody light chain expression in said mouse is inactive.
 10. The method of claim 1, wherein the heavy chain transgene is devoid of a CHI gene segment and the genome comprises no functional antibody light chain locus.
 11. The method of claim 1, wherein the heavy chain transgene is devoid of a gamma CHI gene segment and the genome comprises no functional antibody light chain locus.
 12. The method of claim 1, wherein the heavy chain transgene is devoid of a mu CHI gene segment and the genome comprises no functional antibody light chain locus.
 13. The method of claim 1, wherein the constant region is a mu constant region.
 14. The method of claim 1, wherein the human VpreB gene has a nucleotide sequence that is at least 85% identical to SEQ ID NO:1.
 15. The method of claim 1, wherein said genome does not comprise a non-human vertebrate species VpreB1 and/or VpreB2 gene.
 16. The method of claim 1, wherein said genome comprises an insertion of human lambda V region or lambda VJ region comprising all of the V regions and intervening sequences, wherein the lambda V region or VJ region comprises a human VpreB gene.
 17. The method of claim 16, wherein non-functional V and/or J gene segments are omitted.
 18. The method of claim 16, wherein the lambda V region or VJ region comprises a human VpreB gene and its associated promoter.
 19. The method of claim 16, wherein the insertion is an insertion into an antibody light chain locus.
 20. The method of claim 16, wherein the insertion replaces endogenous Vλ in whole or in part.
 21. The method of claim 1, wherein the human VpreB gene is not within the endogenous VpreB-λ5 locus.
 22. The method of claim 1, wherein the expression of the human VpreB gene is under endogenous control.
 23. The method of claim 1, wherein the human VpreB gene is operably linked to one or more DNaseI hypersensitive sites.
 24. The method of claim 1, wherein the human VpreB gene is present in the genome in any one of 2, 3, 4, 5, 6, 7, 8, 9 or 10 copies.
 25. The method of claim 1, wherein all λ5 sequences in the genome are mouse λ5 sequences.
 26. The method of claim 22, wherein the human VpreB gene is operably linked to an endogenous promoter.
 27. The method of claim 1, wherein said transgenic mouse expresses a repertoire of Ig heavy chain variable regions that significantly differs from the heavy chain variable region repertoire of a control vertebrate in the proportion expressed as a percentage of use of heavy chain variable gene segments, wherein the control and said transgenic mouse genomes are of the same background strain and both comprise said antibody heavy chain transgene, the transgenes being identical in the control and said mouse, and wherein the control does not express a human VpreB.
 28. The method of claim 1, wherein said transgenic mouse expresses a repertoire of Ig heavy chain variable regions that significantly differs from the IgH heavy chain variable region repertoire of a control vertebrate in the proportion expressed as a percentage of use of heavy chain JH gene segments, wherein the control and said transgenic mouse genomes are of the same background strain and both comprise said antibody heavy chain transgene, the transgenes being identical in the control and said mouse, and wherein the control does not express a human VpreB.
 29. The method of claim 1, wherein said transgenic mouse expresses a repertoire of Ig heavy chain variable regions that significantly differs from the IgH heavy chain variable region repertoire of a control vertebrate in the proportion expressed as a percentage of one or more or all heavy chain variable region gene segments selected from the group consisting of VH6-1, VH1-3, VH3-7, VH1-8, VH3-9, VH3-11, JH1 and JH6, wherein the control and said transgenic mouse genomes are of the same background strain and both comprise said antibody heavy chain transgene, the transgenes being identical in the control and said mouse, and wherein the control does not express a human VpreB.
 30. The method of claim 27, wherein use of one or more or all of VH6-1, VH1-3, VH3-9 and JH1 is higher in the repertoire of said mouse than in the repertoire of the control.
 31. The method of claim 27, wherein use of one or more or all of VH3-7, VH1-8, VH3-11 and JH6 is lower in the repertoire of said mouse than in the repertoire of the control.
 32. The method of claim 4, wherein said transgenic mouse is a C57Bl/6—129/Sv hybrid strain.
 33. The method of claim 13, wherein said mu constant region is an endogenous mu constant region.
 34. The method of claim 16, wherein said genome comprises an insertion of human lambda V region or lambda VJ region comprising all of the V regions and all the J regions and intervening sequences, wherein the lambda V region or VJ region comprises a human VpreB gene.
 35. The method of claim 19, where the insertion is an insertion into an endogenous lambda locus upstream of the endogenous lambda constant region.
 36. The method of claim 26, wherein said endogenous promoter is an endogenous VpreB promoter (e.g., a VpreB1 promoter).
 37. The method of claim 28, wherein the transgenic mouse expresses a repertoire of Ig heavy chain variable regions that differs by a probability of less than 0.05 in a binomial distribution test, from the IgH heavy chain variable region repertoire of a control vertebrate in the proportion expressed as a percentage of use of heavy chain JH gene segments, wherein the control and said transgenic mouse genomes are of the same background mouse strain and both comprise said antibody heavy chain transgene, the transgenes being identical in the control and said mouse, and wherein the control does not express a human VpreB.
 38. The method of claim 1, wherein the genome of said transgenic mouse comprises (a) said antibody heavy chain transgene; and (b) an antibody kappa light chain transgene and/or an antibody lambda chain transgene; wherein all of the V, D and J in said transgenes are human V, D and J; wherein endogenous antibody heavy and light chain expression is inactive.
 39. The method of claim 38, wherein the genome of said transgenic mouse is homozygous for said heavy and light chain transgenes.
 40. A method of promoting B cell development in a non-human vertebrate comprising an immunoglobulin heavy (IgH) chain comprising a human IgH chain variable region, wherein said non-human vertebrate is a mouse or a rat, wherein said non-human vertebrate comprises an IgH chain transgene, the IgH transgene comprising one or more human VH gene segments, one or more human D gene segments and one or more human JH gene segments operably connected upstream of a constant region gene so that the transgene is capable of undergoing VDJ recombination in vivo to produce an antibody gene comprising a rearranged VDJC encoding an antibody heavy chain having a human variable region and a constant region, optionally wherein the genome is homozygous for said IgH chain transgene; the method comprising: (i) inactivating endogenous heavy chain expression in said vertebrate; (ii) providing in the genome of said vertebrate: (a) a human VpreB gene capable of expressing a human VpreB, and (b) a mouse or rat λ5 gene; wherein the transgenic mouse is capable of expressing a chimaeric surrogate light chain comprising human VpreB and mouse or rat λ5, wherein the transgenic mouse is capable of expressing a pre-B-cell receptor comprising said chimaeric surrogate light chain and an IgH mu chain comprising said human variable region, wherein said pre-B-cell receptor promotes B cell development of cells bearing said IgH mu chain in said vertebrate.
 41. The method of claim 40, wherein said non-human vertebrate comprising an IgH chain transgene is a mouse, and wherein the constant region of said antibody heavy chain is a mouse constant region.
 42. The method of claim 40, wherein said non-human vertebrate comprising an IgH chain transgene is a mouse, and wherein the constant region of said antibody heavy chain and the λ5 gene are mouse constant region and mouse λ5 gene, respectively.
 43. The method of claim 41, wherein said mouse is a strain selected from the group consisting of a 129-derived mouse strain, a C57Bl/6 derived mouse strain and a JM8 derived mouse strain.
 44. The method of claim 41, wherein the constant region of said antibody heavy chain is of a mouse 129 strain constant region or of a C57Bl/6 strain constant region, and the λ5 gene is a mouse 129 strain or a mouse C57Bl/6 strain.
 45. The method of claim 40, wherein said non-human vertebrate comprising said IgH chain transgene is a mouse, and wherein the constant region gene and the λ5 gene are endogenous genes of said transgenic mouse.
 46. The method of claim 40, wherein the genome is homozygous for said transgene, human VpreB gene and mouse or rat λ5 gene.
 47. The method of claim 1, wherein said non-human vertebrate comprising said IgH chain transgene is a mouse, and wherein endogenous antibody heavy chain expression in said mouse is inactive.
 48. The method of claim 40, wherein said non-human vertebrate comprising said IgH chain transgene is a mouse, and wherein endogenous antibody light chain expression in said mouse is inactive.
 49. The method of claim 40, wherein the heavy chain transgene is devoid of a CHI gene segment and the genome comprises no functional antibody light chain locus.
 50. The method of claim 40, wherein the heavy chain transgene is devoid of a gamma CHI gene segment and the genome comprises no functional antibody light chain locus.
 51. The method of claim 40, wherein the heavy chain transgene is devoid of a mu CHI gene segment and the genome comprises no functional antibody light chain locus.
 52. The method of claim 40, wherein said non-human vertebrate comprising said IgH chain transgene is a mouse, and wherein the constant region of said antibody heavy chain is a mouse mu constant region.
 53. The method of claim 40, wherein the human VpreB gene has a nucleotide sequence that is at least 85% identical to SEQ ID NO:1.
 54. The method of claim 40, wherein said genome does not comprise a non-human vertebrate species VpreB1 and/or VpreB2 gene.
 55. The method of claim 40, wherein said genome comprises an insertion of human lambda V region or lambda VJ region comprising all of the V regions and intervening sequences, wherein the lambda V region or VJ region comprises a human VpreB gene.
 56. The method of claim 55, wherein non-functional V and/or J gene segments are omitted.
 57. The method of claim 55, wherein the lambda V region or VJ region comprises a human VpreB gene and its associated promoter.
 58. The method of claim 55, wherein the insertion is an insertion into an antibody light chain locus.
 59. The method of claim 55, wherein the insertion replaces endogenous Vλ in whole or in part.
 60. The method of claim 40, wherein the human VpreB gene is not within the endogenous VpreB-λ5 locus.
 61. The method of claim 40, wherein the expression of the human VpreB gene is under endogenous control.
 62. The method of claim 40, wherein the human VpreB gene is operably linked to one or more DNaseI hypersensitive sites.
 63. The method of claim 40, wherein the human VpreB gene is present in the genome in any one of 2, 3, 4, 5, 6, 7, 8 9 or 10 copies.
 64. The method of claim 40, wherein said non-human vertebrate comprising said IgH chain transgene is a mouse, and wherein all λ5 sequences in the genome are mouse λ5 sequences.
 65. The method of claim 61, wherein the human VpreB gene is operably linked to an endogenous promoter.
 66. The method of claim 40, wherein said non-human vertebrate comprising said IgH chain transgene is a mouse, wherein said transgenic mouse expresses a repertoire of Ig heavy chain variable regions that significantly differs from the heavy chain variable region repertoire of a control mouse in the proportion expressed as a percentage of use of heavy chain variable gene segments, wherein the control and said transgenic mouse genomes are of the same background strain and both comprise said antibody heavy chain transgene, the transgenes being identical in the control and said mouse, and wherein the control does not express a human VpreB.
 67. The method of claim 40, wherein said non-human vertebrate comprising said IgH chain transgene is a mouse, wherein said transgenic mouse expresses a repertoire of Ig heavy chain variable regions that significantly differs from the IgH heavy chain variable region repertoire of a control vertebrate in the proportion expressed as a percentage of use of heavy chain JH gene segments, wherein the control and said transgenic mouse genomes are of the same background strain and both comprise said antibody heavy chain transgene, the transgenes being identical in the control and said mouse, and wherein the control does not express a human VpreB.
 68. The method of claim 40, wherein said non-human vertebrate comprising said IgH chain transgene is a mouse, wherein said transgenic mouse expresses a repertoire of Ig heavy chain variable regions that significantly differs from the IgH heavy chain variable region repertoire of a control vertebrate in the proportion expressed as a percentage of one or more or all heavy chain variable region gene segments selected from the group consisting of VH6-1, VH1-3, VH3-7, VH1-8, VH3-9, VH3-11, JH1 and JH6, wherein the control and said transgenic mouse genomes are of the same background strain and both comprise said antibody heavy chain transgene, the transgenes being identical in the control and said mouse, and wherein the control does not express a human VpreB.
 69. The method of claim 66, wherein use of one or more or all of VH6-1, VH1-3, VH3-9 and JH1 is higher in the repertoire of said mouse than in the repertoire of the control.
 70. The method of claim 66, wherein use of one or more or all of VH3-7, VH1-8, VH3-11 and JH6 is lower in the repertoire of said mouse than in the repertoire of the control.
 71. The method of claim 43, wherein said transgenic mouse is a C57Bl/6-129/Sv hybrid strain.
 72. The method of claim 52, wherein said mu constant region is an endogenous mu constant region.
 73. The method of claim 55, wherein said genome comprises an insertion of human lambda V region or lambda VJ region comprising all of the V regions and all the J regions and intervening sequences, wherein the lambda V region or VJ region comprises a human VpreB gene.
 74. The method of claim 58, where the insertion is an insertion into an endogenous lambda locus upstream of the endogenous lambda constant region.
 75. The method of claim 65, wherein said endogenous promoter is an endogenous VpreB promoter (e.g., a VpreB1 promoter).
 76. The method of claim 67, wherein the transgenic mouse expresses a repertoire of Ig heavy chain variable regions that differs by a probability of less than 0.05 in a binomial distribution test, from the IgH heavy chain variable region repertoire of a control vertebrate in the proportion expressed as a percentage of use of heavy chain JH gene segments, wherein the control and said transgenic mouse genomes are of the same background mouse strain and both comprise said antibody heavy chain transgene, the transgenes being identical in the control and said mouse, and wherein the control does not express a human VpreB.
 77. The method of claim 40, wherein said non-human vertebrate comprising said IgH chain transgene is a mouse, and wherein the genome of said transgenic mouse comprises (a) said antibody heavy chain transgene; and (b) an antibody kappa light chain transgene and/or an antibody lambda chain transgene; wherein all of the V, (D) and J in said transgenes are human V, (D) and J; wherein endogenous antibody heavy and light chain expression is inactive.
 78. The method of claim 77, wherein the genome of said transgenic mouse is homozygous for said heavy and light chain transgenes. 